FIGURE l 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGGAGGTCTGAAGACTAACATTTTGTGAAGTTGTAAAACAGAAAACCTGTTAGAA ATG TGGTGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
T CATACAT TACTGCAGTAACAC T CCACCATATAGAC CCGGC TTTACCT TATATCAG TGACAC TGG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
CTACCATT TAT GT TCGT TATAAGCAAGT TCATGC TC TGAGTCC T GAAGAGAACG T TATCATCAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
H CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 

O TAT ATATGT TTGT TCAGACCATC CT TT CC TACCAAATGCAGCCCAAAAT CC ATGGCAAACAAG TC 

■r\ 

:!! TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 

Ul 

e s ; AT CAGT TT TGCACAGT GGCAAT TT TGGGACTGAT TTAGAACAGAAAC TC CATTGGAACCCCGAGG 

y| ACAAAGGTTATGTGCTTCACATGATCACTACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTT 

%l GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 

■k3 T GGAT TAACCCT CTATGACACTGCACCTTGCCC TAT TAACAATGAACGAACACGGC TACTT T CCA 

^ GAGATATTTGATGAAAGGATAAAATATTTCTGTAATGATTATGATTCTCAGGGATTGGGGAAAGG 

jjij TTCACAGAAGTTGCTTATTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACT 

H GAT GAAT GCTGATAAT CAGGAAACATGAAAGAAGCC ATTT GATAGAT TAT T C TAAAGGATAT CAT 

rij CAAGAAGACTATTAAAAACACCTATGCCTATACTTTTTTATCTCAGfiAAATAAAGTCAAAAGACT 
Q ATG 
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FIGURE 2 

<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T}: 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 
LCIATIYVRYKQVHALSPEEOTIIKLNKAGLVLGILSCLGLSIVANFQKTTLFAAHVSGAVLTFG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 
LLSRDI 

Important features: 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 

N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE * 

CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGA.ee 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCTCCCAGATACTATTTTTTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTTTCAT GGTGCCTTT T TACATTGGCTAT TT TAT TGT GAGCAATATC CGACTACTGCATAAACAACGA 
y f CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
Q CATTCTCAGCCCAAAACATGGGAT CTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGT GATTGGAG 

□ TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 

CTCAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATATGAT 

Y:l CATAAGCAAAAAGAAAAGGATGGCAATGGGACGGAGAACAATGTTCCAGAAGGGGGAAGTGCATAACA 

If* 

f I AAC CATC AGGT TTCTGGGGAATGATAAAAAGTGTTAC CACTTCAGCATCAGGAAGTGAAAATCTTACT 

?*i CTTATTCAACAGGAAGTGGAT GCT TTGGAAGAAT TAAGCAGGCAGCT TTT T CTGGAAACAGCT GATCT 

TSttJt 

g ATATGCT ACCAAGGAGAGAATAGAATAC T CCAAAACC TTCAAGGGGAAATATT TTAATTT TCTT GGTT 

2| ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
Wl GGGAAAACGGATCCTGTCACAAGAGGCATTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGT 
^ GAAGTTTTGGTCCCAACACATTTCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGAT 
•JJ TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
Si CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 

AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAATTTA 
GATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAACAAACAAAATGCTATGGTAGC 
-A.TT TTT CACCT TCATAGCAT ACTC CTT CCCCGT CAGGTGATACTAT GACCATGAGTAGCATCAGCCAG 
AACATGAGAGGGAGAACTAACTCAAGACAATAC T CAGCAGAGAGCAT CCCGTGTGGATAT GAGGCTGG 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
CTATGGTAGCTGAGCCAAACACGTAGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTG 
CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE ± 

MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFSLIIFEILGV 
LNSSSRYFHWKMNLCTILLILVEMVPmGY 

FPI LS PKHGI LS IEQL I SRVGVI GVTLMALLSGFGAVNCPYT YMS YFLRNVTDTDI LALERRLLQ 
TMDMIISKKKRMftMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TVNYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
FVSSVLL IRMSMPLEYRT I ITEVLGELQFNFYHRWFDVI FLVSALSS I LFLYLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 
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FIGURE s 



AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AG AAC T C T C CAT CC G G AC TAG T TAT T GAGCAT CTGCCTCT CAT AT C AC C AG T GG CC AT C T GAGG T 
GTTTCCCTGGCTCTGAAGGGGTAGGCACG ATGG CCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCATGCAGAATTATGGGGATCACCCT TGTGAGCAAAAAGGCGAACCAGCAGCTGAAT T TCACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGT TT GCAGCCTAT TGTTACAACTCAT CTGAT AC T TGGACT AACT CGTGCAT TCCAGAA 
AT TATCACCACCAAAGATCCCATAT TCAAC^CTCAAACTGCAACACAAACAACAGAAT T TAT TGT 
U- CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
^ CTGCTCCAGCTTCCACTTCTATTCCACGGAGAAAAAAATTGATTTGTGTCACAGAAGTTTTTATG 
%1 GAAAC TAGCACCATG T CTACAGAAACT GAACCAT TTG T TGAAAATAAAGCAGCAT TCAAGAATGA 

AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
CP CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
U| CAGCAGAAGGAAATGATCGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAATGATAGCAACCCTAA 
\k T GAGGAAT CAAAGAAAACTGATAAAAACCCAGAAGAGT CCAAGAG TCCAAGCAAAACTACCGTGC 

f ! GAT GCC TGGAAGCT GAAGT T TAGA T GAGACAGAAAT GAGGAGACACACC TGAGGCT GGT T TC TT T 

CAT GCTCC TTACCCTGCCC CAGC TGGGGAAATCAAAAGGGC CAAAGAAC CAAAGAAGAAAG T CCA 
Q CCCT T GGT TCC T AAC T GGAATCAGCTCAGGACTGCCATTGGACTATGGAGT GCACCAAAGAGAAT 

15 GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
f'\ TTCTAGCC TGGCTATGTCC TAATAATATCCCACTGGGAGAAAGGAGT TTTGCAAAGTGCAAGGAC 

f% CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 

AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
Q GCTCTGAAAGAGAAACACG TATC CC AC CTGACATGT CCT T C TGAGCC CGGTAAGAGCAAAAGAAT 

ft! GGCAGAAAAGTTTAGCCCCTGAAAGCCATGGAGATTCTCATAACTTGAGACCTAATCTCTGTAAA 
p GC T AAAATAAAGAAAT AGAACAAGGC T GAGGATACGACAGTACAC TGTCAGCAGGGAC TGTAAAC 

1$] ACAGACAGGGT CAAAGTGT T TTCTC TGAACACAT TGAGT T GGAATCAC T G T T TAGAACACACACA 

! 14 CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 

AAAATAAAAAC TCT TATAAAT T T CTAT TT T TAT CTGAGT T ACAGAAATGAT TAC TAAGGAAGAT T 
ACTCAGTAATTTGTTTAAAAAGTAATAAAATTCAACAAACATTTGCTGAATAGCTACTATATGTC 
AAGTGCT G TGCAAGGTAT TACACTC TGTAATTGAAT AT TAT TCC TCAAAAAAT TGCACATAGTAG 
AAC GCTATC TGGGAAGCTAT T T TT T TCAGT TT TGAT ATT T C TAGCT TATC TAC TT CCAAAC TAAT 
T T T TATT T T TG C T GAGAC T AAT C T TAT T CAT T T T CT C T AAT AT GG CAAC CAT TAT AAC C T TAAT T 
TAT TAT TAAC AT AC C TAAGAAG T ACAT TGT TAC C TC TAT AT AC CAAAG C AC AT T T T AAAAG T G C C 
ATT AACAAATG T AT CAC TAGCCCTCC TT TT TCCAACAAGAAGGGAC T GAGAGATGCAGAAAT ATT 
TGT GACAAAAAATTAAAGCAT T TAGAAAAC TT 
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FIGURE 6 

MARCFSLVLLLTS IWTTRLLVQGSLRAEELS I QVSCRIMGI TLVSKKANQQLNFTEAKEACRLLG 
LSLAGKDQVKTALKASFETCSYGWGDGFWISRISPNPKCGKNGVGVLIWKVPVSRQFAAYCYN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKLICVTEVmETSTMSTETEPFVENKAAFKNEi^GFGGVPTALLVLALLFFGAAAGLGFCYVK 
RYVKAFPFTNKNQQKEMI E TKWKEEKANDSNPNEESKKTDKNPEESKS PSKT TVRCLEAEV 

Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGGACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCGCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAATTGCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCTGCTA 
AAGCATCATCAGAAGTGAACCTGGCAAACTTACC TCCCAGC TATCACAATGAGACCAACACAGAC 
ACGAAGGT T GGAAATAATACCAT CC AT G T GCACC GAGAAAT T C ACAAGATAACCAACAAC CAGAC 
T GGACAAATGGT CT TT TCAGAGACAGTTATCACATC TGTGGGAGACGAAGAAGGCAGAAGGAGCC 
{ ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
* ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
f GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
J ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
1 ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
j CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
| CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
j GAGATCC TGCTGCCCAGAGAGGT CC CCGATGAGTATGAAGT TGGCAGC TTCATGGAGGAGGT GCG 

| CC^LGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
■■ CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 
TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
I CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
! CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
j GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 
TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAG TCACACAGCTAGTGAAGACCAGAGCAGT T TCATCTGG T TGTGACTC TAAGCTCAG T GCTCT 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
T TT TCTTGAGGCAT GCACATC T GGAAT TAAGG TCAAACTAAT TC TCACATC CCTCTAAAAG T AAA 
CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 
AT T GACAC T G T C CC T C T T T GG CAG T TGCAT TAGTAAC T T T GAAAGGT AT AT GAC T GAGC GTAG C A 
TACAGGT TAACCTGCAGAAACAGTACTTAGGTAATTGTAGGGCGAGGATTATAAATGAAATT TGC 
AAAATCACT TAGCAGCAAC TGAAGACAAT TATCAACCACGTGGAGAAAATCAAACCGAGCAGGGC 
T GT G TGAAACAT GGT T GTAATAT GCGACTGCGAACACTGAACTCTACGCCACT CCACAAATGAT G 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CAT GATTGTATAAGCATGCTTTCTT TGAGT TTTAAAT TATGTATAAACATAAGTTGCATTTAGAA 
ATCAAGCATAAATCACT TCAAC T GCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 



MQRLGATLLCLLIAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEEIiMEDTQHKL 
RSAVEEMEAEEAAAKAS SEVNLMLPPSYHNETNTDTKVGNNTIHVHRE IHKITNNQTGQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYYCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 
RS LTEEMALGEPAAAAAALLGGEE I 

Signal sequence: 

amino acids 1-19 

N-glycosylation site. 

amino acids 96-100,. 106-110, 121-125, 204-208 
Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 

N-myristoylation site. 

amino acids 202-208, 217-223 

Amidation site. 

amino acids 140-144 
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FIGURE Q 



CGGA CGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
C CACAAGC CCCAGGAC AT C CC T ACC T T CCC C T TAAT CTCTGCCTT CAT C CGCACAC TC C GT T AC C 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGCATCATGTGCTGTTT 
l CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
J TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 

I ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 

1 

J GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
I ACT TTAAGAGCCCCCACCTCAACTAT TACT GGCTGCCCAT CAT GACCTCCATCCTGGGGGCCTAT 

j GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
I GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 

AGAT T C T GGGCAAGAAGAACGAGGC GC CCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTCCGG 
j CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
j CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
J T TGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTT CGAGACCAGCCTGGCCAACATGGT G 

1 AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
J GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 

GAT CGCGCCAC T GCACTCCAACCTGGGTGACAGACT C TG T C TCCAAAACAAAACAAACAAACAAA 

AAGAT T T TAT T AAAG ATAT T T T G T T AAC T C 
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FIGURE 10 



RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
™VLALGQCVLAGAFASFYWAFHKPQDIPTFPLISAFIRTLRYHTGSLAFGALILTLVQIARVIL 
E Y I DHKLRGVQNPVARC IMCC FKCCLWCLEKFI KFLNRNAY IMIAI YGKNFCVSAKNAFMLLMRN 
IVRVWLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 



c! 
m 



■Pi! 



rsi 



Important features: 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites . 

amino acids 16-20, 27-31, 289-293 



Hypothetical YBR002c family proteins. 

amino acids 276-288 



Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites . 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE 11 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATCGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 
GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 
GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACT TCGTGCC TTACT GAGTCTCT AAGAC TTTT TCTAATAAACAAGC CAGT GCGTGTAAAAAAAA 
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FIGURE 12 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLPWCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVS IAAVLPKV 
QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 
VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
S FFHFCLVLAS LHVMMTLTNWYKPGE TRKMI S T WTAVWVK I CAS WAGLLLYLWTLVAPLLLRNRD 
FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains : 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 



CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGT TGGAAAAAGACTC CT G TAAC CCTC CTCCAGGATGAACCACCT GCCAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGATTGAGTCCTATGAAGGAAGGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
T GAATGGAGGCAT T GAGAACAGAT T AGAGAAGGAGGTGAT GCAGTAT GACT AC TAT TCT T CATAT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
CTTGCCTGGATTGAGACGTGGTTCCTGGATTTCAAAGTGTTACCTCAAGAAGCAGAAGAAGAAAA 
CAGACTCC TGATAGT T CAGGAT GC T TCAGAGAGGGCAGCAC T TATAC C T GGTGGTC T T TCTGAT G 
GTCAGTTT TATTCCCCTCC TGAATCCGAAGCAGGATCTGAAGAAGCT GAAGAAAAACAGGACAGT 
GAGAAACCAC T T TT AGAAC TATGAGTACTACTT T TGT T AAATGT GAAAAACCCTCACAGAAAGT C 
ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 
CACTGCTGGC T T TATT GAACAGCTAATAAAGAT T TAT T TATTG TAATAC CT CACAAAC G T TGTAC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 
AAAAGACT TAATATAT TGAAGTAACACT T T T T TAGTAAGCAAGATACC TT T T TAT T TCAAT T CAC 
AGAATGGAAT TTTTTTGTTT CAT GT CTCAGATT TAT TTTGTAT T TCT T TT TTAACACTCTACAT T 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
C T T GCAAAAGGGGAAGAAAGGAAT T GCGAATACATGTAAAATGTCACCAGACAT T TGTATTATT T 
T TATCATGAAAT CATGTT TT TC T CT GAT T GT TC T GAAATGT TC TAAATAC T C TTAT T T TGAATGC 
ACAAAATGACT TAAACCAT TCATAT CATGT T TCCTT T GCGT TCAGCCAAT T T CAAT TAAAAT GAA 
CTAAATTAAAAA 
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FIGURE 14 

MNHLPEDMENALTGSQS SHASLRNI HS INPTQLMARI E S YEGREKKGI S DVRRT FCLFVT FDLLF 
VTLLWI IELNVNGGIENTLEKEVMQYDYYS S YFD I FLLAVFRFKVL I LAYAVCRLRHWWAIALTT 
AVTSAFLLAKVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AAL I PGGLSDGQFYS PPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 



si Transmembrane domains: 

01 

i'l amino acids 54-12, 100-113, 130-144, 146-166 



N-myristoylation sites, 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 



; ii; 

111 
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FIGURE « 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCGCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
C GAGAC GT GCCACCCGACACGGT GGGGCTGTACGT CT T TGAGAACGGCATCACCATGC TCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCAGGACC 
TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCGGTGGTGCGGGAGCCCACAGCCTTGTCT 
TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
GCGAGGAGGCC T G CGGGGAGGCC CATACACCCCCAGCC GT CCAC TCCAACCACGCCCCAGTCACC 
CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 
GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGGCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCGTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGCTGTGTGACTCTAGTCTTGGCCCCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGC 
T T T AGGAACAT GT TTTGCTTTTT T AAAATAT AT AT AT T T ATAAGAGAT C C T T T C CC AT T T AT T C T 
GGGAAGATGTT T T T CAAAC TCAGAGACAAGGACT T T GGT TT T TGTAAGACAAACGATGAT AT GAA 
GGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 
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FIGURE 16 

MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSWFGPWVRESHVTLASPEE 
TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 
LLAALAAVGAAYCVRRGRAMAAAAQDKGQVGPGAGP LE LE GVKVPLE PGPKATEGGGEALP S G S E 



I , I CEVPLMGFPGPGLQS PLHAKP Y I 

''H Important features: 

WJ Signal peptide: 

%-,>, amino acids 1-23 

If! Transmembrane domain: 

I amino acids 579-599 

EGF-like domain cysteine pattern signature, 
amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 

GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATGCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGATGAATCCT TAGATTCCAAGAC TACT T TGACATCAGATGAGTCAGTA 
AAGGACCATACTACTGCAGGCAGAGTAGT TGCTGGTCAAATATTTCT TGATTCAGAAGAATCTGA 
AT T AGAAT CCTC TAT T CAAGAAGAGGAAGACAGCCT CAAGAGCCAAGAGGGGGAAAGTGTCACAG 
AAGATATCAGCT T TCTAGAGTCTCCAAATCCAGAAAACAAGGACTATGAAGAGCCAAAGAAAGTA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
T T T CC TAGATAAGGAG T AT GAT GAAT GT AC ATCAGAT G GGAGGGAAGAT GGCAGAC TG T GGT GT G 
C TACAAC C TAT GACTAC AAAGCAGATGAAAAG TGGGGCTTT T GT GAAAC T GAAGAAGAG GC T G C T 
AAGAGACGGCAGAT GCAGGAAGCAGAAATGATGTAT CAAAC TGGAAT GAAAATCCT TAATGGAAG 
CAATAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 
CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATCCAG 
GCAG C GAGAGAGAT G T T T GAGAAGC T GAC T GAGGAAG GC T C TC C CAAGGGACAGAC T G C T C T T GG 
CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTT TAG TGGAAGGCT 
AATAATAT TAACAT CAGAAGAAT TTGTGGTT TATAGCGGC CACAACT TT TT CAGC T T TCATGAT C 
CAGAT TTGCTT GTATT AAGACCAAATAT TCAG T TGAAC TT CC T T CAAAT TC T TG T TAATGGATAT 
AACACATGGAAT CTACATG TAAATGAAAGT TGGTGGAGTCCACAAT T T T T C TT TAAAATGAT TAG 
TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 
GAAT TGTCAGAATCAT TT TT TACAT TAGATTATCATAAT T T TAAAAAT TT T TCT T TAG T TT TTCA 
AAAT T T TGTAAATGGTGGC TATAGAAAAACAACATGAAATATTATACAATAT TT TGCAACAATGC 
CCTAAGAAT TGT TAAAAT TCATGGAGT TAT TTGTGCAGAATGAC TCCAGAGAGC T C TAC TT T CT G 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT TAACTC^TT T TTAATAAAATTATGTCTAAGATTAAAAAAAARAAAAAAAAAAAAAAAA7^A 
AAAAAAAAAAAAAAAAAAAAA^y\AAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESS IQEEEDSLKSQEGESVTEDI SFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAK^^ 

EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
SSQAKALVYYTFGALGGNLIAHMVLVSRL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myr i s toylation si tes . 

amino acids 120-126, 253-259, 268-274 , 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C-terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE 1Q 

AATTCAGATTTTAAGCCCAT TCTGCAGTGGAAT T TCATGAACTAGCAAGAGGACACCATCTTCT T 
GTAT TATACAAGAAAGGAGTGTACC TATCACACACAGGGGGAAAAATGC TCT TT T GGG TGCTAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACAT T T T TAT CACTGGATGTGACTCGGGCT T T GGAAACT TGG CAGCCAGAACT TT T GAT 
AAAAAGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 
AACC T CAGAGAGACT T CGTACTGTGCTTC TGGATGT GACCGACCCAGAGAAT GTCAAGAGGACT G 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
C CCGGCGTGC TGGCTC CCACTGACT GGCTGACAC TAGAGGACTACAGAGAACCTAT TGAAG T GAA 
CCTGTTTGGACTCATCAGTGTGACACTAAATATGCTTCCTTTGGTCAAGAAAGCTCAAGGGAGAG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 
ATGCAT TGAACCAGGATTG TTCAAAACAAACT T GGCAGAT CCAGTAAAGG TAAT T GAAAAAAAAC 
T CGCCAT T TGGGAGCAGCTGT C T CCAGACATCAAACAACAATATGGAGAAGGTTACAT TGAAAAA 
AGT C TAGACAAAC T GAAAGGCAATAAATCC TAT GTGAACATGGACCT C T C T CCGGTGG TAGAGT G 
CAT GGACCACGC TCTAACAAGTC TCT T CCC TAAGACT CAT TATGCCGCT GGAAAAGAT GCCAAAA 
TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 
GCAGAGC T GGCT AATC CCAAGGCAGTGTGACTCAGC TAAC CACAAAT GT CT CCTCCAGGCTATGA 
AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTAT T TAGGC T T TGCCTGCT TGGTGT GATGTAAGGGAAAT TGAAAGAC T T GCCCATT CAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
ATCATCTCTTATCTAAATATTAAAAGATAAGTCAACCCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 
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FIGURE 20 

MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
STALKAE T SERLRTVLLDVTDPENVKRTAQWVKNQVGEKGLWGL INNAGVPGVLAPTDWLTLEDY 
REPIEWLFGLISVTLMLPLVKKAQGRVINVSSVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSCIEPGLFKTNIJU3PVKVIEKKIAIWEQL^ 

LSPWECMDHALTSLFPKTHYAAGKDAKI FWI PLSHMPAALQDFLLLKQKAELANPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites . 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 

CTGAGGCGGCGGTAGCATCGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GG T GAAGCCAAGAACAGCAT TACTGATTCCCAAATGGATGATGTTGAAGTT G T T TATACAAT TGA 
CATTCAGAAATATAT TCCATGCTATCAGCT TTTTAGCT TTTATAATT CTTCAGGCGAAGTAAATG 
AGCAAGCACTGAAGAAAATATTATCAAATGTCAAAAAGAATGTGGTAGGTTGGTACAAATTCCGT 
CGT CAT TCAGAT CAGATCATGACG T T TAGAGAGAGGC T GC T TCACAAAAACT TGCAGGAGCATTT 
T TCAAACCAAGACCT T G T T T T TCT GC TAT TAACACCAAGT ATAATAACAGAAAGCTGC TCTACT C 
ATCGACTGGAACATTCCTTATATAAACCTCAAAAAGGACTTTTTCACAGGGTACCTTTAGTGGTT 
GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
T TT TAGCCGAGCAGTACAAACACACAGCTC TAAATT T T TT GAAGAAGAT GGATCCT TAAAGGAGG 
TACATAAGATAAATGAAAT GTAT GCT T CAT TACAAGAGGAAT TAAAGAGTATATGCAAAAAAGTG 
GAAGACAG TGAACAAGCAG TAGATAAACTAGTAAAGGATGTAAACAGAT TAAAACGAGAAAT TGA 
GAAAAGGAGAGGAGCACAGAT TCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCC TCAGGAGA 
ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
AT GTCT T TAAAAAATAGACATGT T T CT AAAAGTAGCTGTAACTACAACCAC CATC TCGAT GT AGT 
AGACAATC T GAC C T TAAT G GTAGAACACAC T GAC AT T C C T GAAGC TAG T CC AGC TAG TAC AC CAC 
AAATCAT TAAGCATAAAGCCTTAGACT TAGATGACAGATGGCAAT TCAAGAGATC T CGGT TGTT A 
GATACACAAGACAAAC GATCTAAAGCAAATAC TGG TAGTAG TAACCAAGATAAAGCAT CCAAAAT 
GAGCAGCCCAGAAACAGATGAAGAAAT TGAAAAGATGAAGGGTT T TGGT GAATAT TCACGGT CTC 
CTACATTT TGA TCCTTTTAACCTTACAAGGAGATTTTTTTATTTGGCTGATGGGTAAAGCCAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
TGTT TGCAGTAATACACAGATAACT CT TAGTGCAT T TACT T CACAAAG TACT TT TT CAAACATCA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
ACACAT TCT TTAGAAT TGGAAAAGT GAGACCAGGCACAGT GGCTCACAC CT G TAATCCCAGC AC T 
TAGGGAAGACAAG TCAGGAGGAT T GAT T GAAGC TAGGAGT TAGAGACCAGCC TGGGCAACGT ATT 
GAGACCATGTCTATTAAAAAATAAAATGGAAAAGCAAGAATAGCC TT AT TT TCAAAATAT GGAAA 
GAAATT TATATGAAAATTTATCTGAG TCATTAAAATTCTCCTTAAGT GATACTTT T TTAGAAGTA 
CATTATGGCTAGAGT TGCCAGATAAAATGCTGGATATCATGCAATAAAT TT GCAAAACATCATCT 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQLFSFYNSSGEWEQALKKILSNVKK3WVGWYKFR 

VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSSCNYNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEE I EKMKGFGE YSRS PT F 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation sites. 

amino acids 75-79, 322-32 6 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCAGCGCAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
, , AACAAGAACGATGCCC CCCT CCACGAGAT CAAC GGT GATCATT T GAAGAT CTGT CCCCAGGGT T CTAC CTGCTGCT CT 

^ CAAGAGAT GGAGGAGAAGTACAGCCT GCAAAGTAAAGATGATT T CAAAAGT GTGGTCAGCGAACAGTGCAATCATTTG 

Zl CAAG CT GT CTTTGCTT CAC GTTACAAGAAGTTT GAT GAATT CT T CAAAGAACTACTTGAAAAT GCAGAGAAAT CCCTG 

pj» AATGATAT GT TTGTGAAGACATATGGCCATTTAT ACATGCAAAAT TCT GAGCTATTTAAAGATCTCTT CGT AGAGTTG 

'f I AAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTGGGCTCGCCTCCTGGAGCGGATG 

TT CC GCCT GGTGAACT CCCAGT ACCACTTTACAGATGAGT ATCT G GAAT GT GTGAGCAAGT ATACGGAGCAGCTGAAG 
u : CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 

TTAG C GGTT GCGGGAGATGTCGT GAG CAAGGTCT C CGTGGTAAACCCCACAGCC CAGT GTACCCATGCCCT GT TGAAG 
ATGAT CTACTGCTCCCACTGCCGGGGTCT CGTGACT GTGAAGCCATGTTACAACTACT GCT CAAACAT CAT GAGAGGC 
fi?$ T GT TT GGC CAACCAAGGGGATCT CGATTTT GAAT GGAACAATTT CATAGAT GCTATG CT GAT GGT GGCAGAGAGGCTA 

f l\ GAGG GTCCT TTCAACATTGAAT C GGT CAT GGAT C CCAT CGATGT GAAGATT TCT GAT GCTATTAT GAACAT GCAGGAT 

^ AATAGTGTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACGAATTTCTCGT 
£J| . TCCATCTCTGAAAGTGCCTTC&GTGCTCGCTTCAGACCACATCA 

Ui[ AGTTTGGACCGACTGGTTACTGATGTCAAGGAGAAACTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAAC 
^5 S GTTT GCAAC GAT GAGAGGATGGCTGCAGGAAAC GGCAATGAGGATGACTGT TGGAAT GGGAAAG GCAAAAGCAGGTAC 

in 

CT GTT TGCAGTGACAGGAAAT GGATT AGCCAAC CAGGGCAACAACCCAGAGGTCCAGGT TGACACCAGCAAACCAGAC 
AT ACT GAT C CTTCGT CAAATCAT GGCTCT T C GAGT GATGACCAGCAAGAT GAAGAAT GCAT ACAAT GGGAAC GACGTG 
GACTT CTTT GAT AT CAGT GAT GAAAGTAGT GGAGAAGGAAGTGGAAGT GGCTGT GAGT ATCAGCAGTGCCCTTCAGAG 
TTTGACTACAATGCCACTGACCATGCTGGGAAGAGTGCCAATGAGAAAGCCGACAGTGCTGGTGTCCGTCCTGGGGCA 
CAGGC CTACCTCCT CACT GTCTT CT GCAT CTTGTTCCT G GTTAT GCAGAGAGAGT G GAGATAAT T CTCAAACTCT GAG 
AAAAAG TGTT CATCAAAAAGTTAAAAGGCACCAGTTATCACTT T TCT ACCATCCT AGT GACTT T GCTTTTTAAATGAA 
T GGACAAC AAT GTACAGTTTTTACTATGT GGCCACT GGTT T AAGAAGT GCT GACT TTGTTTT CT C ATT CAGTTTT GGG 
AGGAAAAGGGACTGT GCATTGAGTTGGTT CCTGCTC CCCCAAAC CATGTTAAACGTGGCTAACAGTGT AGGTACAGAA 
CT ATAGTTAGTT GTGCATTT GT GATTTTAT CACT CT ATTAT TT GT TT GTAT GTT TTTT TCT CATTT CGTT T GTGGGTT 
TTT TT TTCCAACTGT GATCT CGCCTT GTTT CTTACAAGCAAACCAGGGTC C CTT CTT GGCACGTAACAT GTACGTATT 
TCT GAAATATTAAAT AGCTGTACAGAAGCAGGTT TT ATTT ATCAT GTTAT CTTAT TAAAAGAAAAAGCCCAAAAAGC 
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FIGURE 2d 



MARFGL PALLCT LAVLS AALLAAELKSKSC S EVRRLYVSKGFNKNDAPLHE I NGDHLKI CPQGS T 
CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLFVELKRYYWGNVNLEEMLN^ 

QLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKVSVWPTAQCTHALLKMIYCSHCRGL 
VTVKPCYNYCSNIMRGCIJU^QGDLDFEWNNFIDAMLMVAERLEGPFNIESVMDPIDVK 
MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWS S LPS NVCNDERMAAGNGNEDDCWNGKGKS RYL FAVT GNGLANQGNNPE VQVDTS 
KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 
KS ANEKADSAGVRPGAQAYLLT VFC I L FLVMQREWR 

Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

GlycosaminoglYcan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 



CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
C TGAGTCAT CCC CAGGGATCAGGAGCCTCCAGCAGGGAAC CT TCCAT TAT AT TC T TCAAGCAAC T 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
T CAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCAC CACAGAAAGCCAAACAAGCAT T CCAGA 
, ik GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
'*) &GCGCCCACTCTTCCAAT TAAACAT TCTCAGCCAAGAAGACAGTGAGCACACCTACCAGACACTC 

11 TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 



ul 



3 



f s 1 



TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGCTAGTGTCAT TT AACC TTAAATGCAATCAGGAAAG TAGCAAACAGAAGT CAATAAAT AT T T T T 
AAATGTCAAAAAAAAAAAAAAAAAA 
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FIGURE 26 



MKVLISSLLLLLPLMLMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 

amino acids 27-33, 46-52 



App_ID=10063570 



Page 176 



FIGURE 27 



GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCC ATGG ACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCGCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACGGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 
ATAAAGGGGCTTAGAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
C TCTGTGCAGAGCCCAAGGAAGGT CC T GCAGGAGG T CCGGAGAGTAC TGAGACCGGGAGGT GTGC 
T CTT T TTC TGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCT TCATGT GGCAGCAAGT TT T C 
GAGCCCACCTGGAAACACATTGGGGATGGCTGCTGCCTCACCAGAGAGACCTGGAAGGATCTTGA 
GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
T TCCCCAGCC T CCAAT TAGAACAAGCC ACCCACCAGCCT ATCTAT CT TC CAC TGAGAGGGACC TA 
GCAGAATGAGAGAAGACAT TCAT GT ACCACCTACTAGT CCC TC T C TCCC CAACC T C TGCCAGGGC 
AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTAGGCTGACCCAGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
ATGCCAGAGCAAGACT CAAAGAGGCAGAGGT TT TGTTCTCAAATATT TTTTAATAAATAGACGAA 
ACCACG 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYIJyiAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQLADGSMDWVCT LVLCSVQS PRKVXQE WRVLRPGGVLFFWEHVAE PYGSWAFMWQQVFEPTW 
KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLE QATHQ P I YLPLRGT 

Important features: 
Signal peptide: 



f-\ amino acids 1-23 



Leucine zipper pattern. 

amino acids 10-32 

N-myri s toylation si tes . 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 



C"1 



Ul 



CAATGTTTGCCTATCCACCTCCCCCM.GCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAG TGGGTGGGCTAGGGGGGCTGCCT TAT T T AAA 
GT GGT TGT TTAT GAT T CT TATACTAAT TTATACAAAGATATTAAGGCCCTGT TCAT TAAGAAAT T 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 



w 
m 
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FIGURE so 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 

Growth factor and cytokines receptors family: 

amino acids 3-18 
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FIGURE ai 

GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CCTCCAAGCAAGTCATTTCCCTTATTTAACGGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
AT T TGCATC TGT T T TGATAAATGAT G T TGACACCCT CCAC C GAAT TC TAAGTGGAAT CATGT CGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
T G T CTGCGGTG TTT TATGGTGGC TGTAT TATGAC TATACCAACGACCTCAGCATAGA&T TGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
CTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATTGACAGTTGAGCTTTTCCAAATCAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AAT TGGCC TCAT CT GGACTAGTGAAT T CATCC T TGCG TGCCAGCAAAT GACTATAGC T GGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GATT CCGAGAAT CAT TGTCATGTACAT GCAAAACGCACT GAAAGAACAGCAGCATGGTGCAT TGT 
CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
AACCAGAATGCATATAC TACAACTGC TAT TAAT GGGACAGATT TCTGTACAT CAGCAAAA.GATGC 
AT T CAAAATCT T GT CCAAGAACT CAAGTCACT T TACAT CTATTAACT GCT T TGGAGACT T CATAA 
TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
T GGAAACAAATGAT GGATCGTCAGAAAAGC CC TACT T TATGGATCAAGAAT T TCTGAGT TTCGTA 
AAAAGGAGCAACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCAC TCAT TAAGGAAT GAGGA 
GGGAACAGAACTCCAGGCCATTGTGAGATAGATACCCATTTAGGTATCTGTACCTGGAAAACATT 
T CCT TCTAAGAGCCATT TACAGAATAGAAGATGAGACCACT AGAGAAAAGT TAGTGAAT T T T TTT 
T TAAAAGACCTAATAAACCCTATTCT TCCTCAAAA 
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FIGURE 32 

MSGRDTILGLCILAIALSLAMMFTFRFITTLLVHIFISLVILGLLFVCGVLWWLYYDYTNDLSIS 
LDTEREl^MKCVXGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSAPFLLFQPLWTFA 
I L I FFWVLWVAVLLS LGTAGAAQVMEGGQVE YKPLSGI RYMWS YHL I GL I WT S E FI LACQQMT I A 
GAWTC YFNRSKNDPPDHP IL S SLS I L FFYHQGT WKGS FL I S WRI PRI IVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FIIFLGKVLWCFTVFGGLMAFNYNRAFQWAVPLLLVAFFAYLVAHSFLSVFETVLDALFLCFA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

Important features: 
Signal peptide: 

amino acids 1-2 0 

Putative transmembrane domains: 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 
N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 
N-myristoylation sites . 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 33 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAAT TTG T ATGGGAT TTGTGAT GCAGGAAAGCC TAAGGGAAAAAGAATAT TCAT TCTG TG TGGT 
GAAAATTTTTTGAAAAAAAAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATTTATGAGGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
ATTCAAACAAAGAAACGGCAAAGAAGATTAAAAGGCCCAAGTTCACTGTGCCTCAGATCAACTGC 
GATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATGTCCAGCAGGATGCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGT TACAAAGGGAGT TAT TCCAACGGT GT CCAATCGT T AT CCC TACCACGATGGAGAGAATCCT T 
TAT CGTCT TAGAAAGT AAACCCAAAAAGGGTGTAACCT ACCCATCAGCT CT TACATAC TCAT CAT 
C GAAAAGTCCAGCT GCCCAAGCAGGTGAGACCACAAAAGCC TAT CAGAGGC CACCTAT TCCAGGG 
|«* ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
f'\ CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
fi\ GCCACAGGAGCCAGGAGATGGAT CTCTGGTCCAC TGCCACC TACACAAGCAGCCAAAACAGGCCC 

5f AGAGCTGATCCAGGTATCCAAAGGCAAGATCC T T CAGGAGC TGCC TT CCAGAAACC TG TT GGAGC 

01 GGATGTCAGCC T GGGACT TGT TCCAAAAGAAGAATT GAGCACACAGT CTT T GGAGCCAGT AT CC C 

yj TGGGAGATCCAAACTGCAAAATTGACTTGTCGTTTTTAATTGATGGGAGCACCAGCATTGGCAAA 
\jj CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
r j CGGTCCACTGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACAC 
ACACGAAT TCTCGAGATCTGAAGACAGCCATAGAGAAAAT TAC T CAGAGAGGAGGAC T TT CT AAT 
W GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
S GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
r*J GAC T TGCGAGAGAGTCAGGAAT C AACATT T TC T TCATCAC CAT TGAAGGTGCTGC TGAAAAT GAG 

j!5 AAG CAG TAT G T GG T G G AGC CC AAC T T T G CAAACAAGG C C GT GT GC AGAAC AAAC GGC T T C TAC T C 

Z; GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
Q ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
fit GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
Q AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 
S| TGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 
'■ J TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
GTC CAAGC CCAACAAGAGGAAGT TAATGATCC TCATCACC GACGGGAGGT CCTACGACGACGTCC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGCT AGAAGTCAT TGCCAC TCACCCCGCCAGAGACCAC TCCTTCTTTGT GGACGA 
GTT TGACAACC T CC AT CAGTAT GTCCCCAGGATCAT CCAGAACAT TTG T ACAGAGT T CAAC T CAC 
AGCC TCGGAAC TGAAT TCAGAGCAGGCAGAGCACCAGCAAGTGCTGC T T TACTAAC TGACG T G T T 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 
T GATCACAAACGTATAGAATGAGCCAAAAGGCTACATCATGTTGAGGGTGCTGGAGATT TTACAT 
TTT GACAATTGT TTTCAAAATAAAT GTTCGGAATACAGTGCAGCCCT TACGACAGGCTTACGTAG 
AGC T T T T G TGAGAT T T T TAAGT T GT TAT T TCTGATT TGAAC TC TGTAACCCTCAGCAAGT T T CAT 
TTTTGTCATGACAATGTAGGAATTGCTGAATTAAATGTTTAGAAGGATGAAAAATAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^AAAAAAAAG 
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FIGURE 34 

MRTVVLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
ES FI VLESKPKKGVT YPSALT YS SSKSPAAQAGETTKAYQRPP I PGT TAQPVTLMQLLAVTVAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVSLGLVPKEELSTQSLEPYSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 
GPAGPLMGWQYGDMPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RSGAPNVYVVMVDGWPTDKYEEASRLARESGINIFFITIEGAAENEKQY^ 
FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSAblGFVIDGSSSVGTGNFRTVLQFVTN 
Q LTKEFE I SDTDTRI GAVQYT YEQRLE FG FDKYSSKPDI LNAIKRVGYWSGGTSTGAAINFALEQL 

Q FKKSKPNKRKLMI LIT DGRS YDDVRI PAMAAHLKGVT TYAI GVAWAAQEELEVIATHPARDHS FF 

Cfj VDEFDNLHQYVPRIIQNICTEFNSQPRN 

yj 

Z\ Important features: 

Q Signal peptide : 

3 amino acids 1-26 

\ f 1 

'%l Transmembrane domain: 

wl ' 

m amino acids 181-200 

? I? 
r s: i 

fit N-glycosylation sites . 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE 3 5 



CCGAGC AC AGGAGATT GCCTGCGT TT AGGAGGTGGC TGCGT T GTGGGAAAAGCT ATCAAGGAAGAAAT T GC 
CAAACC ATGT CT T T TTTT CTGTTT T C AGAGTAGT T CAC AAC AGATCTGAGTGTT TTAATT AAGCATGG AAT 
ACAGAAAACAACAAAAAACTTAAGCT TT AATTTCATCT GGAATTCCACAGTTTTCTTAGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTG GATGG CCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGAT GT CACTGAGATCCC TC AAAT GGAGCC TCCT GCTGCTGT CACTCCTGAGTT TCTTTGTGATGTGGTAC 
CTCAGC CTTCCCCACT ACAATGTGAT AGAACGCGTGAACTGG ATGT ACTT CT AT GAGTAT GAGCCG AT T T A 
CAGACAAGACTTTC ACTT CACACT TCGAGAGCATTC AAAC T GCT CTCATC AAAATCCATT TC TGGTCAT TC 
Ur TGGT GACCTCCCACCCTT C AGATGTGAAAGCC AGGC AGGCC ATT AGAGTT ACT T GGGGTGAAAAAAAGTCT 

f | TGGTGGGGATATGAGGTTCTTACATTTTTCTTATTAGGCCAAGAGGCTGAAAAGGAAGACAAAATGTTGGC 
f £{ | ATTGTCCTT AGAGGAT GAAC ACCT T CTT T ATGGTGACATAATCCGACAAGATT T TTTAGACAC AT ATAAT A 

yl ACCT GACCTTGAAAACCATT ATGGCATT C AGGTGGGT AACT G AGTT T T GC CCC AAT GCCAAGT ACGTAAT G 

ill AAGACAGACACTGATGTTTTCATCAATACTGGCAAT TTAGTGAAGT AT CTTTT AAACCTAAACC ACTCAGA 

m GAAGTTTTTCACAGGTTATCCTCTAATTGATAATTATTCCTATAGAGGATTTTACCAAAAAACCCATATTT • 

"'■■I CTTACC AGGAGT AT CCTT TCAAGGT GTT CCCTCC AT AC TGCAGTGGGT TGGGTT AT ATAATGT CCAGAGAT 

^ TTGGTGCCAAGGATCTATGAAATGATGGGTCACGTAAAACCCATCAAGTTTGAAGATGTTTATGTCGGGAT 
% .. CTGT TT GAATTTATTAAAAGTGAACATT CATATTCCAGAAGACACAAATCTTTTCTTTCTAT ATAGAATCC 

\ n ATTTGGATGTCTGTCAACTGAGACGTGTGATTGCAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTT 
{ « % T GGC AGGTCATGCTAAGGAACACC ACAT GCCATT AT TAACTTCACATT CT ACAAAAAG CC TAGAAGGACAG 

. ; S5 /f 

m GAT ACCTTGTGGAAAGTGTT AAAT AAAGTAGGT ACT GT GGAAAATTCAT G GGGAGGTC AGTGTGCT GGCT T 

f! ACACTGAACTGAAACT C ATGAAAAACCC AG ACTG GAGACT GG AGGGTT AC ACTT GT GATT TAT T AGTCAGG 

f|J CCCT TCAAAG ATGAT ATGTGGAGGAAT T AAAT AT AAAGGAATTGGAGGT T TT TGCT AAAGAAATT AAT AGG 

ACCAAACAATTTGGAC AT GTCATT CT GT AG ACT AGAAT T T CTTAAAAGGGTGTT ACTG AGTT AT AAGCTC A 
CT AGGC TGTAAAAACAAAACAATGTAGAGTTTT ATT TAT T GAACAATGTAGT C ACT TGAAGGTTT TGTGT A 
TATCTT ATGTGG AT TACC AATTTAAAAAT ATATGT AGT T C TGTGTC AAAAAACT TCTTCACT GAAGTT AT A 
CT GAAC AAAATTTT ACCT GTTTTT GGTC ATTT AT AAAGT ACTTCAAGAT GTT GC AGTATT TC AC AG TT ATT 
ATTATT T AAAATTACT TC AACTTT GTGT TTTT AAAT GT T T TG ACGATT T C AAT ACAAGAT AAAAAG GATAG 
TGAATC ATTCTT TACATGCAAACATTTT CC AGTT AC TT AACT GATCAGT T TATT ATTG AT AC ATC ACTCC A 
T TAATGT AAAGTCAT AGGT C ATTATTGC AT ATCAGT AAT CTCTTGG ACTT T GTT AAAT AT TT T ACTGTGGT 
AATATAGAGAAGAATT AAAGCAAGAAAATCTGAAAA 
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FIGURE 36 



MASALWTVLPSRMSLRSLKWSLLLLSLLSFFVMWYLSLPHYNVI ERVNWMYFYEYEPIYRQDFHF 
TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDI IRQDFLDTYNNLTLKT IMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KPIKFEDVWGICLNLLKWIHIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGCCGCGGCAaG GRTGG AGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
TCGTCCT TGCCAAGAGAGT ACACAGTCAT TAATGAAGCCTGCCCTGGAGCAGAGTGGAATATCATGTGTCGGGAGTGC TGTG 
AATATGAT CAGAT TGAGTGCGTC T GCCCCGGAAAGAGGGAAGTCGTGGGTTATACCATCCC TTGCTGCAGGAATGAGGAGAA 
TGAGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAAACTGCAAGAGCTGCCGAAATGGCTCATGGGGGGGT 
ACC TTGGAT GAC T TC T ATGTGAAGGGGT TCTAC TGTGCAGAGTGC CGAGC AGGC T GGTACGGAGGAGAC TGCATGCGATGTG 
GCCAGGTTCTGCGAGCCCCAAAGGGTCAGATTTTGTTGGAAAGCTATCCCCTAAATGCTCACTGTGAATGGACCATTCATGC 
TAAAC CTGGGTT TGTCATCCAAC TAAGAT TTGTCATGTTGAGTCTGGAGTT T GAC TACATGTGCCAGTATGACTATGTTGAG 
GTT CGTGAT GGAGACAACCGCGATGGCCAGATCATCAAGCGTGTCTGTGGCAACGAGCGGCCAGCTCCTAT C CAGAGCATAG 
GAT CC TCACTCCACGTCCTC T TCCACTCCGATGGCTCCAAGAATTTT GACGGTT TCCATGC CATTTATGAGGAGATCACAGC 
ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
jssfe TATAC TGGGCAGCGC T GTGAAAATCTCCT TGAAGAAAGAAAC TGCTC AGACC CTGGGGGCCCAGTCAATGGGTAC CAGAAAA 

Q TAACAGGGGGCCCTGGGCTTATCAACGGACGCCATGCTAAAATTGGCACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGT 
lJ{ TCT TAGTGGCAATGAGAAAAGAAC TTGCCAGCAGAATGGAGAGT GGT CAGGGAAAC AGCCCATC TGCATAAAAGC CTGC CGA 

GAACCAAAGATTTCAGACCTGGTGAGAAGGAGAGTTCTTCCGATGCAGGTTCAGTC^ 
|j ACTCAGCGGCC T TCAGC AAGCAGAAACTGCAGAGTGCCCCTAC CAAGAAGCCAGCCC TTCCCT T TGGAGAT C T GCCCATGGG 

ATACCAAC ATC TGCATACC C AGC TCCAGT ATGAGTGCATC TCAC C CT TC TAC CGCCGCC TGGGCAGCAGCAGGAGGACATGT 
t i C TGAG GACTGGGAAGTGGAGT GGGCGGGC ACCATCCTGCATCC C TATC TGCGGGAAAATTGAGAACATCAC T GCTCCAAAGA 

0\ CCCAAGGGTTGCGCTGGCCGTGGCAGGCAGCCATCTACAGGAGGACCAGCGGGGTGCATGACGGCAGCCTACAC^AGGGAGC 

GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
GTCACCATGATCAAGACAGCAGACCTGAAAGTTGTTTTGGGGAAATTCTACCGGGATGATGACCGGGATGAGAAGACCATCC 
fll AGAGC CTACAGATTTC TGC TATCATTCTGCATCCCAACTAT GACC CC ATCC TGCT TGATGC TGACATCGCCATCC TGAAGC T 

%!. CCTAGACAAGGCCCGTATCAGCACCCGAGTCCAGCCCATCTGCCTCGCTGCCAGTCGGGATCTCAGCACTTCCTTCCAGGAG 

TCC CACATC AC TGTGGC TGGC TGGAATGTCCTGGCAGACGTGAGGAGCCCTGGCTTCAAGAACGACACACTGCGCTCTGGGG 
? J J TGGTCAGTGTGGTGGAC TCGC TGC TGTGT GAGGAGCAGCATGAGGAC C ATGGC ATC C CAGTGAGTGTCAC T GATAACATGT T 

**J w CTGTGCCAGC TGGGAAC CC AC TGCCCCTT C TGATATC TGCACTGCAGAGACAGGAGGCATCGCGGCTGTGT C C TTCCCGGGA 

CGAGCATCT CC T GAGC CACGC TGGCATCTGATGGGACTGGTCAGCTGGAGC TATGATAAAACATGCAGCCACAGGCTC T C CA 
CTGCCTTCACCAAGGTGCTGCCTTTTAAAGACTGGATTGAAAGAAATATGAAATGRACCATGCTCATGCACTCCTTGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGCCAGGGCTT CTGAC TTC AGGGACAAAAC TCAGTGAAGGGTGAGTAGAC C TCCAT TGC TGGTAGGC TGAT GCCGCGTC C A 
C TACTAGGACAGCCAAT TGGAAGATGCCAGGGCTTGCAAGAAGTAAGT T TC T TCAAAGAAGAC CATATACAAAACCTC T C C A 
C TC CACT GACCTGGTGGTC T TCCCCAACT TTCAGTTATACGAAT GCC AT CAGC TTGACCAGGGAAGATCTGGGCTTCAT GAG 
GCCCCTT T T GAGGCTC TCAAGT TC TAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGCTGGGATGTGGTGCATGC C T T 
TGT GTACAT GGCCAC AGTAC AGTC TGGTCCTTTTCCTTCCCCAT C TC TTGT ACACAT TTTAATAAAATAAGGGTTGGC T T C T 
GAAC T ACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQILLE S YPLNAHCEWT I HAKPGFVT QLRFVMLS LE FDYMCQYDYVEVRDGDNRDGQI I 
KRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKAGSYKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQPIC IKACREPKI SDLVRRRVLPMQVQSRET PLHQLYSAAFSKQKLQSAPTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVl^RTVWAAHCVTDLGKVTMIKTADL 
KWLGKFYRDDDRDEKT IQSLQI SAI I LHPNYDP I LLDADI AI LKLLDKAR I S TRVQP ICLAASR 
DLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSGWSWDSLLCEEQHEDHGIPVSVTDNMFCA 
SWEPTAPSDICTAETGGIAAVSFPGRASPEPRWHMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide : 

amino acids 1-23 

EGF-like domain cysteine pattern signature, 
amino acids 260-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAA.TCTTCTTACGGGCCCGTGATTTATTAACGTGGCTTAATC 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAGAATGAAGGCGCTTCTGTTGC 
TGGTCTTGCCTTGGCTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTGTA 
AAGGT GCCTCC CACT ACGGCCT GAC CAAAGATAGGAAGAGGC GCTCACAAGAT GGCT GTCCAGACGGCT GT GCGAGCC 
TCAC^GCC^CGGCTCCCTCCCCAGAGGTTTCTGCAGCTGCCACCATCTCCTTAATGACAGACGAGCCTGGCCTAGACA 
ACCCTGCCTACGTGTCCTCGGCAGAGGACGGGCAGCCAGCAATCAGCCCAGTGGACTCTGGCCGGAGCAACCGAACTA 
GGGCACGGCCCT TTGAGAGAT CCACT ATTAGAAGCAGATCATT TAAAAAAATAAATCGAGCTTTGAGT GT T CTTC GAA 
GGACAAAGAGCGGGAGTGCAGTTGCCAACCATGCCGACCAGGGCAGGGAA^TTCTGAAAACACCACTGCCCCTGAAG 
TCTTT CCAAGGT TGT ACCACCT GATT CCAGAT GGTGAAATTAC CAGCATCAAGAT CAATCGAGTAGAT CCCAGTGAAA 
GCCTCTCTATTAGGCTGGTGGGAGGTAGCGAAACCCCACTGGTCCATATCATTATCCAACACATTTATCGTGATGGGG 
T GATC GCCAGAGACGGCCGGCTACTGCCAGGAGACAT CATT CT AAAGGTCAACGGGAT GGACAT C AGCAAT GT CCCT C 
y I ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 
■UJ GCAGGAACAAT GGACAGGCCCCG GAT GC CTACAGACC CCGAGAT GACAGCT TTCATGT GATTCT CAACAAAAGTAGCC 

Ul CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 
"'4. GTGTGGCATATCGACATGGTCAG CTT GAGGAGAATGACCGT GT GTTAGCCATCAATGGACATGAT CTT C GATATGGCA 

Q GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 
.8 GGAGCCCTGACATCTTTCAGGAAGCCGGCTGGAAC^ 

C I CTCCCAAGCCCCTCCATCCTACAATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCT CG 

; n 

W \ GCATGACCGTCGCAGGGGGAGCATCACATAGAGAAT GGGATTTGCCTATCTATGT CAT CAGTGTT GAGCCC GGAGGAG 

luzj TCATAAGCAGAGATGGAAGAATAAAAACAGGTGACATTTTGTTGAAT GTGGATGGGGT CGAACT GACAGAGGTCAGCC 

\H \ 

I U GGAGTGAGGCAGTGGCATTATT GAAAAGAACAT CAT CCTC GATAGTACTC AAAGCTTT GGAAGT CAAAGAGT ATGAGC 

CCCAGGAAGACT GCAGCAGCCCAGC^GCCCTGGACTCCAACCACAACATGGCCCCACCCAGTGACTGGTCCCCAT CCT 

; 

\U GGGTCATGTGGCTGGAATTAC CACGGTGCT TGT ATAACTGTAAAGAT ATT GTATT AC GAAGAAACACAGCT GGAAGTC 

T GGGCTTCT GCATTGTAGGAGGTTAT GAAGAAT ACAATGGAAACAAACCT TTTTT CAT CAAAT CCATT GTTGAAGGAA 
CACCAGCATACAATGATGGAAGAATTAGAT GT GGT GATAT T CTT CTT GCT GTCAAT GGTAGAAGTACAT CAGGAAT GA 
T ACAT GCTT GCT T GGCAAGACT GCT GAAAGAACTTAAAGGAAGAATTACT CTAACTATTGTTT CTTGGC CT GGCACTT 
T TTT ATAG ftAT CAAT GAT GGGT CAGAGGAAAACAGAAAAAT CACAAAT AGGCTAAGAAGTT GAAACACT ATATTTATC 
TTGTCAGTTTTTATATTTAAAGAAAGAATACATTGTAAAAATGTCAGGAAAAGTATGATCATCTAATGAAA 
ACACCTCAGAAAATATGATTCCAAAAAAATTAAAACTACTAGTTTTTTTTCAGTGTGGAGGATTTCTCATTACTCTAC 
AACATTGTTTATATTT TTTCTAT TCAATAAAAAGC CCTAAAACAACTAAAATGATTGATTT GTATACCCCACT GAAT T 
CAAGCTGATTTAAATT TAAAATT TG GTATAT GCT GAAGTCT GCCAAGGGTACATTAT GGC CATTTTTAATT TACAGCT 
AAAATATT TTTTAAAATGCATT GCTGAGAAACGTTGCTTT CAT CAAACAAGAATAAATATT TTTCAGAAGTT AAA 
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FIGURE AO 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRSFKKINR 
ALSVLRRTKSGSAVANHADQGRSNSENTTAPEVFPRLYHL I PDGEITS IKINRVDPSESLS IRLV 
GGSETPLVHI I IQHIYRDGVIARDGRLLPGDI ILKVNGMDI SNVPHNYAVR^LRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGVFIFNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKWNIQKDPGESLQ1TVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMZIPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 

ACCAGGCATTGTATCTTCAGTTGTCATC 

CTTGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAACATGGGCTTCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGAC CCTCATTTT GGGGAAGGGAAAAACT C TGAC TAATGAAGCATC CACGAAGAAGGTAGAACTT 
GACAACT GTCCTTCTGTGTCT CCT TAC CT CAGAGGCC AGAG CAAG C T CAT T T T CAAACC AGAT CT CAC 
T TT GGAAGAGGTACAGGCAGAAAATCCCAAAGT GTCCAGAGGCCGGTAT CGCCCT CAGGAATGT AAAG 
CTT TACAGAGGGTCGC CATC CTCGTTCCCCAC C GGAACAGAGAGAAACACCTGATGTACCT GCTGGAA 
CATCTGCATCCCTTCCTGC^GAGGCAGC^GCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
TAAAAAGTTTAATCGAGCCAAACT CTTGAATGT GGGC TATC TAGAAGCCCT CAAGGAAGAAAATT GGG 
ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 
CAT CCCAAGCATCTGGT GGT TGGCAGGAACAGCACTGGGTACAGGTTAC GTTACAGTGGATATT TTGG 
GGGTGT T ACTGCCCTAAGCAGAGAGCAGT TTTT CAAGGT GAAT GGATTC T CTAACAACTACTGGGGAT 
GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
CCT GAAGTGGGTAAAT ATACAATGGTCTT CCACACTAGAGACAAAGGCAAT GAGGTGAAC GGAGAACG 
GATGAAGCTCT TACACCAAGTGTCACGAGTCTGGAGAACAGATGGGTTGAGTAGTTGTTCTTATAAAT 
TAGTATC TGTGGAACACAAT CCTT TATATATCAACAT CACAGTGGAT TTCT GGTT TGGTGCATGACCC 
T GGATCT TT TGGTGAT GTTT GGAAGAACT GATT CTTT GTT TGCAATAAT T T T GGC CTAGAGACT TCAA. 
ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
TAGCAGAGCTCCTGGT GAT GTAGAGTATAAAACAGT T GTAACAAGACAGCT TTCT TAGTC ATT T T GAT 
CAT GAGGGTTAAATAT T GTAATAT GGATACTTGAAGGACTTTATATAAAAGGATGACTCAAAGGATAA 
AATGAACGCTATTTGAGGACTCTGGTTGAAGGAGATTTATTTAAATTTGAAGTAATATATTATGGGAT 
AAAAGGC CACAGGAAATAAGACTGCTGAATGTC TGAGAGAACCAGAGTTGTT CTC GTCCAAGGTAGAA 
AGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCTGTGAAGTGGTGGTGTCAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACACAGTGAACTTGGGAATGAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTT CAGGGGAGGACCT GCCCAGGT ATGCCTTC CAGTGATGCCCACCAGAGAATACATTCTCTAT TAGT 
TTT TAAAGAGT TTTTGTAAAATGATTT TGTACAAGTAGGATAT GAATTAGCAGTTTACAAGTTTACAT 
ATT AACT AATAATAAATAT GTCTATCAAATACCTCTGTAGTAAAATGT GAAAAAGCAAAA 
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FIGURE 42 

MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEFMANFHKTLILGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RNREKHL^LLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLOT FHDV 
DLVPENDBl^LYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSI^YWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINITVDFOTFGA 



Important features: 
Signal peptide; 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 



U1 
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FIGURE 43 

GCTCAAGACCGAGCAGTGGGACAGC CAGACAGACGGCACGATGGCACTGAGCTCCCAGAT CT GGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGeCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACT TGCAGAGC TGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 

GT T CCAGAGGCGAAGGAGGCGAGACACCCACT TCCC CATC TGCATT T TCT GCTGCGGC TGC T GT C 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACG TAGA ACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAAAAAAMiU\AAA^ 

AAAAAAAAAAAAA?^AAAAAAAAAAAAAAAA 
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FIGURE J.4 

MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
C I FCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 

amino acids 58-59 

N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 
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FIGURE 4* 



GTGGCTT CATT T CAGTGGC TGAC TT CCAGAGAGCAATATGGCTGGT T CCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACCTTCAACACAACCCCTCTTGTCACCATACAGCCAGAAGGGGGCACTATCATAGTGACCCAAA 
ATCG TAATAGGGAGAGAGTAGACT TCC CAGATGGAGGCTAC TCCCTGAAGC TCAGCAAAC TGAA.G 
AAGAATGACTCAGGGATCTACTATGT GGGGATATACAGCT CATCAC T CCAGCAGCCCTCCAC CCA 
GGAG TACG TGCTGCATGTC TACGAGCACCTGT CAAAGCCTAAAGTCACCAT GGGTCTGCAGAGCA 
ATAAGAATGGCACCTGTGTGACCAATCTGACATGCTGCAT GGAACAT GGGGAAGAGGATGTGATT 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
CTGGAGATGGGGAGAAAGTGATATGAC CT TCATC TGCGT TGCCAGGAAC CC TGTCAGCAGAAAC T 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
!jj GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
iff GTT TC TGAAGAGAGAGAGACAAGAAGAGTACAT T GAAGAGAAGAAGAGAGT GGACATT TGTC GGG 

^1 ^CTCCTAACATATGCCCCCATTCTGGAGAGAAac^GAGTACGACACAATCCCT^OVCTAAT 
AGAACAATCCTAAAGGAAGATCCAGCAAATACGG TT TACT CCACTGT GGAAATACCGAAAAAGAT 
GGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTA 
TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 



W 

m 



a 
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FIGURE 4.6 

MAGSPTCLTLIYILWQLTGSAMGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 
VKRNPVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVD I CRET PN I C PHS GENTE YDT I PHTNRT I LKE DPANT VYS T VE I PKKMENPHS LLTMPDT 
PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern ♦ 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, ■ 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE ah 



GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATT CAGCC TGCT GGT T CTACTGCTGTTAGGAGTAGT TCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGG T CT GATGGCCAT TCCAGCAACAACAAT GT CCT T GACAGCAAGAAAAAGAGC GT G 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAATGCCAAT T GTGAAT T TT CAT T GAAAAACATCAG TGACAT TCAT CCAGAATCCT T CAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACACCATGGCGAGTGGCTGGAGAGCATCTAGTTTCCACTTCGATTCTGAAGAAAACAAACATAGG 
% CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
0| CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
Ij.J T G TAGT T TAATGGGAATAAAATGTAAGTATCAGTAGT T TGAAAAAAAAAAA 

m 

a 
m 

Q 

nr 
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FIGURE A8 

MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 
TTMSLT7VRKRACC3SINRTGMFLSSFFSVITVIGALYCMLISIQALLKGPLMCNSPSNSNANCEFSL 
KNISDIHPESFNLQWFFNDSCAPPTGF3STKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 



AppJD=10063570 



Page 198 



FIGURE aq 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAACC TGCAGAG T CACACAGT TGC TGAC CAATT GAGC TGTGAGCCTGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAGCTGAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAAGACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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FIGURE so 



m 



\ u 



MERVTLALLLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCKYKS 
SQKQHSPVPEKAIPLITPGSATTC 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 36-59 

N-myristoylation sites. 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE si 

GTGGACTCTGAGAAGCCCAGGCAGTTGAGGACAGGAGAGAGAAGGCTGCAGACCCAGAGGGAGGG 
AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGACCC TGGTGGGAGGAAGACACT C TGGAGAGAGAGGGGGC TGGGCAGAGATGAAGTTC CAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCAC T GGGACAAATATTGGGGAGGC CCT TGGACATGGCC T GGGAGACGC CCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 
TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 
GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
GAAACTCAGCAGGCAGC T T TGGAATGAATCCTCAGGGAGCTCCC TGGGGTCAAGGAGGCAAT GGA 
GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 
AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 
ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 
TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 
CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 
CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
CTTCAGAGGACAGGGAGTTTCCAGCAACATGAGGGAAATAAGCAAAGAGGGCAATCGCCTCCTTG 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAATTTTAAATCCAAGCTGGGTTTCATCAACTGGGATGCCATAAACAAGGACCAGAGAAGCTCTC 
GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 
AACACCACCCTCTCATCACTAATCT GAGCCC T TGCCCT TGAAATAAACCT TAGC TGCCCCACAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 52 



MKFQGPIACLLI^CLGSGEAGPLQSGEESTGTNIGEALGHGLGDiU.SEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHE I GRQAEDVI RHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGI QG FRGQGVS SNMREI S KEGNRLLGGSGDNYRGQGS SWGSGGGDAVGGVNTVNSETS PGM 
FNFDT FWKNFKS KLGFINWDAI NKDQRS SRI P 

Signal peptide: 
amino acids 1-21 

N-glycosylation site. 

amino acids 265-269 

Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241 r 244-248,. 255-259, 324-328, 388-392 

Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 

N-rayristoylation site . 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE 53 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAAGGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
T TGCAC CCAAGGATAATCTCT TCAT CAGGT T CCT GAAGCCC TGGCTGGGAGAAGGGATACTGCTG 
M AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
£f GAAGTCCTATATAACGATCT TCAACAAGAG T GCAAACATCATGCT TGACAAG TGGCAGCACCTGG 

CCTCAGAGGGCAGCAGTCGTCTGGACATGTT TGAGCACATCAGCC TCAT GACCTTGGACAGTCTA 
u\ CAGAAATGCATCT T CAGCT T TGACAGC CAT TGTCAGGAGAGGCCCAGTGAATATAT TGCCAC CAT 

jjl CT T GGAGC TCAGTGCCC T TGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGAC T T TC TGT 

H ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
W GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
%, AGCCAAGTCCAAGACT T TGGAT T TCAT TGATGTGCT T CTGCTGAGCAAGGATGAAGATGGGAAGG 

= ff CAT TGT C AGAT G AGGATAT AAGAGC AG AGG C T G ACAC C T T C ATGT T T G G AGG CC AT GAC AC C AC G 

□ GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
fll ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 
Q. AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
1 U TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAA.GGCATTACCTG 

CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE 54 



MSLLSLPWLGLRPViMSPWLLLLLWGSWLLAM 

LITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNASAAIAPKDNLFIRFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 
S LMTLDS LQKCI FSFDSHCQERPSE YIAT I LELSALVEKRSQH ILQHMDFL YYL S HDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 

LAFIPFSAGPRNCIGQAFAMAEMKWLALMLLHFRFLPDHTEPRRKLELIM^^ 
VGLQ 

Important features: 
■ f a Transmembrane domains : 

■■.3? 'i 

%l amino acids 13-32 (type II), 77-102 

:W ' 

Cytochrome P450 cysteine heme-iron ligand signature. 

TX amino acids 461-471 

yi 

- 

fJJ N-glycosylation sites. 

P amino acids 112-116, 168-172 
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FIGURE 55 

ATCGCATCAATTGGGAGTACCATCTTCCTCATGGGACCAGTGAAACAGCTGAAGCGAATGTTTGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAAT T CATGGCCAGT T T TATGAAGCT TTGGAAGGCAC TATGGACAGAAGC TGGTGGACAGTT TT 

GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GTGATTCGAACATTTGAGGGTTACTTTTGGAAGCAACAATACATTCTCGAACCTGAATGTCAGTA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
T CT GGATGT TG T CC CAC TGAAT TCC CATGAAT ACAAACCT AT TCAGC AAGAGCAAAAAAAAAAAA 
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FIGURE *6 



yi 



.fas;? 



fy 



MGPVKQLKRMFEPTRLIATIMVLLCFALTLCSAFWWHNKGLALIFCILQSLALTWYSLSFIPFAR 
DAVKKCFAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen -binding domain protein. 

amino acids 30-72 
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FIGURE 57 



CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAAC ATGC TCTGTCTGTGCCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
ATTTT CAAGCTCAGT GTCTTCAT CC CCT CCCAGGAAT TCT CCACCTAC CGCCAGT GGAAGCAGAAAATT GTACAAGCT 
GGAGATAAGGAC CTT GATGGGCAGCTAGACTTT GAAGAATTTGTCCATTAT CT CCAAGATCAT GAGAAGAAGCTGAGG 
CTGGTGTTTAAGATTTTGGACAAAAAGAATGATGGACGCATTGACGCGCAGGAGATCATGCAGTCCCTGCGGGACTTG 
GGAGT CAAGATAT CT GAACAGCAGGCAGAAAAAATT CTCAAGAGCAT GGAT AAAAACGGCAC GATGACCAT CGACT GG 
AACGAGTGGAGAGACTACCACCT CCT CCAC CCCGTGGAAAACATCC C CGAGAT CAT C CTCT ACTGGAAGCAT TCCACG 
AT CTTTGAT GTGGGT GAGAATCT AACGGTCCCGGAT GAGTTCACAGT GGAGGAGAGGCAGAC GGGGAT GTG GTGGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGT CCAT GCCT CCCGCAGCAACAACATGGGCATC GTTGGT GGCTT CACT CAGAT GATTC GAGAAGGAGGGGCCAGG 
TCACT CTGGCGG GGCAATGGCAT CAACGTCCT CAAAATTGC C CCCGAATC AGC CAT CAAATT CAT GGC CTAT GAGCAG 
ATCAAGCGCCTTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCAGGGGCC 
Lu AT CGCCCAGAGCAGCATCTACC CAATGGAGGT CCTGAAGAC CCGGAT GGC GCT GCGGAAGACAGGCCAGTACT CAGGA 

\* AT GCT GGACTGCGCCAGGAGGAT CCT GGCCAGAGAGGGGGT GGCCGCCTT CTACAAAGGCTATGT CCCCAACATGCTG 

GGCATCAT CCCCTATGCCGGCATCGACCTT GCAGTCTACGAGACGCT CAAGAATGCCTGGCTGCAGCACTATGCAGTG 
AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
k*J CC CCT GGCC CTAGTCAGGACCC GGAT GCAGGCGC AAGCCT CTATTGAGGGC GCT CCGGAGGT GACCAT GAGCAGCCTC 

0 1 TT CAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGT CATCCCA 

i i GCTGT GAGCATCAGCTACGTGGT CTACGAGAAC CTGAAGAT CACCCT GGGCGTGCAGT CGCGGTGACGGGGGGAGGGC 

4f CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
Ijj AAGCT GTCT CGAG CCAAGCT GT GAAAACCC TAGACGCACCCGCAGGGAGGGTGGGGAGAGCT GGCAGGCCCAGGGCTT 

% a I GT CCT GCT GACCCCAGCAGACCCTCCTGTT GGTT CCAG CGAAGACCACAGGCATT CCTTAGGGT CCAGGGT CAGCAGG 

CT CCGGGCTCACATGT GTAAGGACAGGACATTTT CT G CAGT GC CTGC CAATAGTGAGCTTGGAGCCTG GAGGCCGGCT 
TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
ATAAT CCAT GAT GAAAGGT GAGGTCACGT GGCCTCCCAGGCCT GACTT CC CAACCTACAGCATTGACGCCAACTT GGC 
IJ TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
T\\ CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
to* GGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTGTCCCCACTGTGGCATGAGGGCAGTGGAGCA 
2| CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 
f\ | - ACTGTTGGGAAAAGGGT TTT GT C CAGAAGGACAAGCC GGACAAAT GAGCGACTT CTGT GCT T CCAGAG GAAGACGAGG 

•Jf GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
O GGGAC CAGC CCCACATT CC ACTT GTGT CACT GCTT GGAAC CTATT TAT TTT GTATTTATTT GAACAGAGTT ATGT CCT 

AACTATTTTTATAGATT TGTTT AATTAAT AGCTTGT CATTTTCAAGT T CATT TTT TATTCAT ATTTAT GTT CATGGTT 
GATTGTAC CT TCCCAAG CCCGCC CAGTGGGAT GGGAGGAGGAGGAGAAGGGGGGCCTT GGGC CGCTGCAGT CACAT CT 
GTCC^GAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCT GCCCTTGCTTAACAATGCCGGCCAACTGGCGACCT CACGGTTGCACTTCCATT CCACCAGAATGACCTGA 
TGAGGAAAT CTTCAATAGGATGCAAAGAT CAAT GCAAAAATT GTTAT ATAT GAACATATAACTGGAGT CGTCAAAAAG 
CAAATTAAGAAAGAATTGGACGTTAGAAGTTGT CATTTAAAGCAGC CT TCTAATAAAGTTGTTT CAAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



III 
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FIGURE *8 



MLCLCLYVPVI GEAQTEFQYFESKGLPAELKS I FKLSVFIPSQE FS T YRQWKQKI VQAGDKDLDG 
QLDFESFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILKSMDKNG 
TMT I DWNEWRDYHLLHPVENI PE I ILYWKHS T I FDVGENLTVPDEFTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVIJIQVHASRSISimGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQ 

ILAREGVAAFYKGYVPNMLGI I PYAGIDLAVYETLKNAWLQHYAVNSADPGVFVLLACGTMSS TC 
GQLAS YPLAL VRTRMQAQAS I EGAPEVTMS SL FKE I LRTE GAFGLYRGLAPNFMKVI PAVS I S YV 
VYENLKI TLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites . 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE so 

GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATOGC 
T TC CCTGGGGCAGATCC TC TTC T GGAGCATAAT TAGCATCAT CAT TAT TC T GGC TGGAGCAAT T G 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACAT TGGGGAGGATGGAATCCTGAGC TGCACT T TT GAACCTGACAT CAAACTT TCTGATAT 
C G TGATACAATGGC TGAAGGAAGGT GTTT TAGGCT T GGT CCATGAGT TCAAAGAAGGCAAAGAT G 
AGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTTGCTGATCAAGTGATAGTT 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
CATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATAAAACTGGAGCCTTCAGCATGCCGG 
% AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
gfi CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
Uj CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
f j T CAACAACACATACTCC TGTATGAT TGAAAATGACATTGC CAAAGCAACAGGGGATAT CAAAGT G 

ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 
Z" CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 
Q GTGCCTTGGCCACAAAAAAGCATGCAAAGTCATTGTTACAACAGGGATCTACAGAACTATT TCAC 

Ul C ACCAGATATGACC TAG T T T TATATTTCTGGGAGGAAATGAATTCAT AT CT AGAAGTC TGGAGT G 

Ijj AGCAAACAAGAGCAAGAAACAAAAAGAAGGCAAAAGCAGAAGGCTCCAATATGAACAAGATAAAT 
|4 ; C TATCT T CAAAGACAT AT TAGAAGT TGGGAAAATAAT T CATGT GAAC TAGACAAGTGTGT TAAGA 

i"fi GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTGTTTGTCTCTGAATTTTTAGTTATATGTGC 
TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TC TT CCCAAC TGACAAATGCCAAAGT TGAGAAAAAT GATCATAAT TT TAGCATAAACAGAGCAGT 
CGGGGACACCGAT T T TATAAATAAACT GAGCACCT T CT T T T TAAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 



MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 

D I VI QWLKEGVLGLVHE FKEGKDELSEQDEMFRGRTAVFADQVI VGNASLRLKNVQLTDAGT YKC 

YIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCE^RWFPQPTVVWASQVDQGANFSEVS 

NTSFELNSEWTMKVVSVLYOTTITO^ 

CVSSFFAISWALLPLSPYLMLK 

Important features: 
Signal peptide: 

amino acids 1-28 



N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 



Transmembrane domain: 



amino acids 258-281 



220-224 




N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 
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FIGURE 61 



TGACGTCAGi^TCACCATGGCCAGCTATCCTTACCGGCAGGGCTGCCCAGGAGCTGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGTGGACTCAGATCACAGTGGCTATATCTCCATGAAGGAGCTAAAGCAGGCCCTGGTCAACTGCA 
m ATT GGTCT TCAT TCAATGATGAGAC CT GCCTCATGATGAT AAACATGT T TGACAAGACCAAGTCA 

Q GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
CP CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAM 
k H AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 

m 

r: GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 

'^j . 

f-t GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 
_g- CCATGACAGCTTCTCGGATGCTATGACCCAACCATCTGTGGAGAGTGGAGTGCACCAGGGACCTT 
C.J T CCTGGCT TC T TAGAGTGAGAGAAGTATGTGGACAT CTCT TCT TTTCCT G T CCC TC TAGAAGAAC 

X jl ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
jS'j AATAGTGAGGAeCGGGGCTGAGGCCACACAGA^ 

n ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
f|| AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCAT GAGAGAGATCT G T CTATGGGACCAGT GGCT T GGAT TC TGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAG TC TCCAGGAGACAGTGGTCAC C T C TCCC TGCCAATACT T T TT T TAATT TGCAT TT T T T TT C 
ATT TGGGGCCAAAAG T CCAGTGAAAT T GTAAGCTTCAATAAAAGGAT GAAACTC TGA 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYI SMKELKQALVNCNWS S FNDE T CLMM INMFDKTKS GR I DVYG FS AL WKFI QQWKNL FQQYDR 

DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGN IRLS FEDFVTMTASRML 

Important features of the protein; 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 63 

CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCG&GGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
^ TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
r""; TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
fll CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
ill TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
f I ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 
^ C AGC TGGGGAAATAT TAATCGGT AT CCAGGAGGCAGCT GGGGGAATAT T AATCGGTATCC AGGAG 

s GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
£J CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
Ul GGGCTA^GCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 
y TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
y% TGCT CATC TCCAAT AAAATAAAAGCACT TATGARAAAAAAAAAA^ 

|-| | AAAAAAAAAAAA&AAAAAAAAAAAAAAAAAAAAA^^ 
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FIGURE 6d 

MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSTCFPPAGGSAVQRWPPSWGLPAMDSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-159 

N-myristoylation sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 



m 
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FIGURE 6 s 

AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 



MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARSQHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 

H N-glycosylation site. 

m 

r TL ammo acids 37-41 

m N-myristoylation sites. 

Ul amino acids 15-21, 19-25, 60-66 

?* ( 

H 

'vs. 

V*V'\ 
) s | J 

f|.E 

111 ■ 
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FIGURE 67 

ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
AGACT CT GCCCC TGAGGTCAAGAGAGGATGGGGC TAT TCACT T T TATATAT T TATATAAAAT TAG 
TAGTGAGAT GTAAAAAAAAAAAAAAAAAA 
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FIGURE 68 

MANPGLGLLLALGLPFLLARWGRAWGQIQT TSANENSTVLPS S TSS S S DGNLRPEAI TAI I WFS 
LLAALLIAVGLALLVRKLREKRQTEGTYRPSSEEQFSHAAEi\IU^QDSKETVQGCLPI 

Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 86-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 

N-myristoylation sites, 
amino acids 7-13, 26-32 
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FIGURE 6q 



GC CAG GAATAACT AGAGAGGAACAATGGGGTTATT CAGAGGTTTT GTTTT CCT CTTAGTT CTGT GCCT GCT GCAC CAG 
TCAAATACTT CCTTCATTAAGCT GAATAATAATGGCTTT GAAGATATTGT CATTGTT ATAGATCCTAGT GT GCCAGAA 
GATGAAAAAATAATT GAACAAATAGAGGATAT GGT GACTACAGCTT CTACGTACCT GT TT GAAGCCACAGAAAAAAGA 
TTT TTT TTCAAAAAT GTATCTATATTAATT CCTGAGAATTGGAAGGAAAATCCTCAGTACAAAAGGCCAAAACAT GAA 
AACCATAAACATGCTGATGTT ATAGTT GCACCACCTACACTCCCAGGTAGAGAT GAACCATACACCAAG CAGT T CACA 
GAATGT GGAGAGAAAGGCGAATACATT CACTT CAC CCCTGACCTT CTACTTGGAAAAAAACAAAATGAATAT GGACCA 
CCAGGCAAACTGTTTGTCCATGAGTGGGCTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTC 
TACCGT GCTAAGT CAAAAAAAAT C GAAGCAACAAGGTGTT CCGCAGGTAT CTCTGGTAGAAATAGAGTT TATAAGTGT 
CAAGGAGGCAGCTGTCTTAGTAGAGCATGCAGAATTGATTCTACAAC^AAACTGTATGGAAAAGATTGTCAATTCTTT 
CCTGATAAAGTACAAACAGAAAAAGCATCCATAAT GTTTATGCAAAGTATTGATTCTGTT GTTGAATTTTGTAACGAA 
AAAAC CCATAATCAAGAAG CT CCAAGCCTACAAAACATAAAGTGCAATTTTAGAAGTACAT GGGAGGT GAT TAGCAAT 
TCTGAGGATT TTAAAAACACCAT ACCCATGGTGACACCACCT CCT CCACCT GT CTT CTCATTGCTGAAGAT CAGT CAA 
Ui; AGAATT GTGT GCTTAGT TCT T GATAAGT CTGGAAGCAT GGGGGGTAAGGACCGCCT AAAT C GAATGAATCAAGCAG CA 

AAACATTTCCTGCTGCAGACTGTTGAAAATGGATCCTGGGTGGGGATGGTTCACTTTGATAGTACTGCCACTATTGTA 
M AATAAGCTAATCCAAATAAAAAGCAGT GATGAAAGAAACACACTCATGGCAGGATTACCTACATAT CCTCT GGGAGGA 

Q ACTTCCATCT GCT CTGGAATTAAATAT GCATTT CAGGT GATT GGAGAGCTACATTCCCAACTCGATGGATC C GAAGTA 

Zl f CT GCT GCTGACTGATGGGGAGGATAACACTGCAAGTTCTTGTATT GAT GAAGT GAAACAAAGTGGGGCCATT GTT CAT 

^ I TTTATTGCTTTGGGAAGAGCTGCTGAT GAAGCAGTAATAGAGATGAGCAAGATAACAGGAGGAAGTCATTTTTAT GTT 

Uj TCAGATGAAGCTCAGAACAATGGCCTCATTGATGCTTTTGGGGCTCTTACATCAGGAAATACTGATCTCTCCCAGAAG 
fi ij TCCCT T CAGCT CGAAAGTAAGGGAT TAACACT GAATAGTAAT GCCT GGAT GAACGACACT GTCATAATTGATAGTACA 

GTGGGAAAGGACACGTTCTTTCTCATCACATGGAACAGTCTGCCTCCCAGTATTTCTCTCTGGGATCCCAGTGGAACA 
% I ATAAT GGAAAATTTCACAGT GGATGCAACT T CCAAAAT GGCCTATCTCAGTAT TCCAGGAACTGCAAAGGT GGGCACT 

f*\ TGGGCATACAATCTTCAAGCCAAAGC GAACCCAGAAACATTAACTATTACAGTAACTTCT C GAGCAGCAAAT T CT TCT 

$ GT GCCT CCAAT CACAGTGAAT GCTAAAATGAATAAGGACGTAAACAGTTT CCCCAGCCCAATGATTGT TTACGCAGAA 

g AT T CTACAAGGAT ATGTACCT GTT CTT GGAGC CAATGT GACT GCTTTCAT T GAATCACAGAAT GGACATAC AGAAGTT 

ft* TT GGAACTTTT GGATAAT GGT GCAGGCGCT GATT CTTT CAAGAATGATGGAGT CTACTCC AGGTATT TTAC AGCATAT 

fe*-f ACAGAAAAT GGCAGAT ATAGCTTAAAAGTT C GGGCTCAT GGAGGAGCAAACACTGC CAGGCTAAAATTACGGCC T CCA 

If] CT GAATAGAGCC GCGT ACATACCAGGCT GG GT AGT GAACGGGGAAAT TGAAGCAAACCCGCCAAGACCT GAAATT GAT 

GAGGATACTCAGACCACCTT GGAGGAT TTCAGCCGAACAGCATCCG GAGGT GCATT T GTGGTATCACAAGT CCCAAGC 
CTTCC CTTGCCT GACC AATAC CCACCAAGTCAAAT CACAGACCTT GATGC CACAGT T CAT GAGGATAAGATT ATT CTT 
p| I ACATGGACAGCACCAGGAGAT AATTTT GATGT T GGAAAAGTT CAACGTTATAT CAT AAGAATAAGTGCAAGT ATT CTT 

*H GAT CT AAGAGACAGTTTTGAT GATGCT CTTCAAGT AAAT ACT ACT GATCT GTCACCAAAGGAGGCCAACT CCAAGGAA 

AGCTTTGCAT TTAAAC CAGAAAATATCT CAGAAGAAAAT GCAACC CACAT ATTTAT T GCCATTAAAAGTAT AGAT AAA 
ffj AGCAATTTGACATCAAAAGTATCCAACATTGCACAAGTAACTTTGTTT^^ 

CCTACACCTACTC CTACT CCT ACTCCTACTCCTGATAAAAGT CATAATTCTGGAGT TAATAT TT CTACGCT GGTATT G 
TCT GT GATTGGGT CTGTTGTAATT GTTAACTTTAT TTTAAGTACCACCATT TGAAC CTTAACGAAGAkAAAAAT CTT C 
AAGTAGACCTAGAAGAGAGTT TT AAAAAACAAAACAAT GTAAGTAAAGGATAT TTCT GAAT CTTAAAATTCATCCCAT 
GT GTGAT CATAAACTCATAAAAAT AATTTTAAGAT GT C GGAAAAGGATACTTT GATT AAATAAAAACACTCATGGATA 
TGTAAAAACT GTCAAGAT TAAAATTTAATAGTTTC ATTTATT TGTTATTT TAT TTGTAAGAAATAGT GATGAACAAAG 
AT CCTTT TT CATACTGATACCT GGTT GT ATATTATTTGATGCAACAGTTT T CT GAAATGATATTTCAAATT GCAT CAA 
GAAAT T AAAAT CATCTATCT GAGTAGT CAAAATACAAGTAAAGGAGAGCAAATAAACAACATTT GGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 7Q 



MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFMQSIDSWEFCNEKTHNQEAP 
S LQNIKCNFRS TWE VI SNSE D FKNT I PMVT P PP PPVFSLLKI SQR I VCLVLDKS GSMGGKDRLNR 
MNQAAKHFLLOTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 
I TWNSLPPS I SLWDPSGT IMENFTVDATSKMAYLS I PGTAKVGTWAYNLQAKANPETLTITVTSR 
m AANS SVP P I TVNAKMNKDVNS FPS PM I VYAE I LQG YVPVLGANVTAF IE SQNGHTEVLEL LDNGA 

Ly GADS FKNDGVYSRYFTAYTENGRYS LKVRAHGGANTARLKLRPPLNRAAY I PGWWNGE I EANPP 

■A RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
^- FDVGKVQRYI I RI SAS I LDLRDS FDDALQVNT TDL S PKEANSKES FAFKPENI S EENATH I F IAI 

KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 

>*t VNFILSTTI 

? || . 

■ M • Signal peptide : 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 



rsi 



Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



CT CCT TAGGTGGAAACCCTGGGAGTAGAGTACTGACAGC AAAGAC CGGGAAAGAC CATACGTCCCC GGGCAGGGGTGA 
CMCAGGTGTC7VTCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGACGCCAAGGTAATTTTGACCCA 
GAGGAGCAAT GAT GTAGCCACCT CCTAACCTT CCCTT CTTGAACCC CCAGT TATGC CAGGAT TTACTAGAGAGTGTCA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GT GCT CAGAAGTGACTT GCCTGAGGGT GGACCAGAAGAAAGGAAAGGT CCCCT CTT GCTGTT GGCTGCACATCAGGAA 
GGCTGTGAT GGGAATGAAG GT GAAAACT TGGAGATT TCACTT CAGT CATT GCT TCT GCCT GCAAGAT CATC CTTTAAA 
AGTAGAGAAGCT GCTCT GTGT GGT GGT TAACT CCAAGAGGCAGAACTC GTTCTAGAAGGAAAT GGAT GCAAGCAGCTC 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTT GCTGGTGCTCCT CTGCTGT GCTATCT CT GT CCT GTACATGTTGGCCTGCACC CCAAAAGGT GACGAGGAGCAG 
CT GGCACTGCCCAGGGCCAACAGCCCCACG GGGAAGGAGG GGTACCAGGCCGT CCTT CAG GAGTGGGAGGAGCAGCAC 
CGGAACTACGTGAGC&GCCTGAAGCGGCAGATCGCACAGCTCAAGGAGG^ 

aatgggcagtaccaagccagcgatgctgctggcctgggtctggac^ggagccccccagagaaaacccaggccgacctc 

ctggccttcctgcactcgcaggtggacaaggcagaggtgaatgctggcgtcaagctggccacagagtatgcagcagtg 
W cctttcgatagctttactctac^gaaggtgtaccagctggagactggccttacccgccaccccgaggagaagcctgtg 
O aggaaggacaagcgggatgagttggtggaagccattgaatcagccttggagaccct gaacaatcctgcagagaacagc 

*li cccaatcaccgtccttacacggcctctgatttcatagaagggatctaccgaacagaaagggacaaagggacattgtat 

gagctc^ccttcaaaggggaccao^c^cgaattc^^ 
y| gtgaaaaatgaaaagctcaacatggccaacacgcttatcaatgttatcgtgcctctagcaaaaagggtggacaag^ 
i « cggcagttcatgcagaattt cagggagatgtgcattgagcaggat gggagagt ccatctcactgttgtttacttt ggg 

& aaagaagaaataaatgaagtcaaaggaatacttgaaaacacttcc^w^gctgccaacttcaggaactttaccttcatc 
y cagctgaatggagaattttctcggggaaagggacttgatgttggagcccgcttctggaagggaagcaacgtccttctc 
^5 tttttctgtgatgtggacatctacttcacatctgaattcctcaatacgtgtaggctgaatacacagccagggaagaag 
M gtattttatccagttcttttcagt cagtacaatcctggcataatatacggccaccatgatgcagtccctcccttggaa 

g cagc^gctggtcataaagaaggaaactggattttggagagactttggatttgggatgacgtgtcagtatcggtcagac 

ttcatcaatataggtgggtttgatctggacatcaaaggctggggcggagaggatgtgcacctttatcgcaagtatctc 
** J cacagcaacctcatagtggtacggacgcctgtgcgaggactcttccacct ctggcat gagaagcgctgcatggacgag 

tfl ctgacccccgagcagtacaagatgtgcatgcagtccaaggccmgaacgagg 

^ GTGTTCAGGCACGAGATAGAGGCTCACCTTCGCAAAraGA^ 

'f * *" GAAGGATTGTGGGAGACACTTTTTCTTTCCTTTT GCAATTACTGAAAGT GGCT GCAACAGAGAAAAGACTTCCATAAA 

fU GGACGACAAAAGAATT GGACTGAT GGGT CAGAGAT GAGAAAGCCT CCGAT TTCT CTCT GTTGGGCTTT TTAC AACAGA 

! f*\ AATCAAAAT CTC CGCTT TGC CTGCAAAAGTAACCCAGT T G CACCCT GTGAAGT GT CT GACAAAGGCAGAAT GCTT GTG 

5 »f AGATTATAAGCCTAATGGTGTGGAGGTT TTGATGGTGT TTACAATACACT GAGAC CT GTT GTTTT GT GTGCT CATTGA 

fU AATATT CAT GATT TAAGAGCAGTTT TGTAAAAAATTCATTAGCAT GAAAGGCAAGCATATTT CT CCT CATAT GAATGA 

GCCTATGAGCAGGGCTCTAGTTTCTAGGAAT 

AGTGAGTACATTAAGTAAAATAAAATGGACCAGAAAAGAAAAGAAACCATAAAT^^ 

TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TTTTTTCCCTTGT GAGTTATAGTCTGCTTATTTAATTACCACTTT GCAAGCCT TACAAGAGAGC ACAAGTT GGCCTAC 
ATTTTTATAT TTTTTAAGAAGATACTTT GAGATG CATTAT GAGAACTTT CAGT T CAAAGCATCAAATT GATGCCATAT 
CCAAGGACAT GCCAAAT GCT GATT CTGTCAGGCACTGAAT GT CAGGCATT GAGACATAG GGAAGGAAT GGTTTGTACT 
AATACAGACGTACAGATACTTTCT CTGAAGAGTATTTT CGAAGAGGAG CAACT GAACACT GGAGGAAAAGAAAAT GAC 
ACT T T CT GCTTTACAGAAAAGGAAACT CATT CAGACTG GTGATAT CGT GATGTACCTAAAAGT CAGAAAC CACAT TTT 
CTCCTCAGAAGTAGGGACCGCTTTCTTACCTGTTTAAATAAACCIAAAGTATACCGTGTGAACC7VAACAATCTCTTTTC 
AAAACAGGGT GCT CCTCCT GGCTT CTGGCTTC CATAAGAAGAAAT GGAGAAAAATATATAT ATATATATATATAT TGT 
GAAAGATCAATCCATCT GCCAGAATCTAGT GGGAT GGAAGTTTTT GCTACATGT TAT CCACCCCAGGCCAGGTGGAAG 
TAACT GAAT TATTTTTTAAATTAAGCAGTT CTACT CAAT CACCAAGAT GCTTCT GAAAATT GCATTTTATTACCATTT 
CAAACTATTTTTTAAAAATAAATACAGTTAACATAGAGTGGTTTCTT 

AT GCAT GAGCTAATTAT CTCTTTGAGT CCTTGCTT CT GTTTGCTCACAGTAAACT CAT TGTTTAAAAGCTT CAAGAAC 
AT TCAAGCT GTTGGTGT GTT AAAAAAT GCATT GTATT GATTT GTACT GGTAGTT TAT GAAAT TT AATTAAAACACAGG 
CCATGAATGGAAGGTGGTATT GCACAGCTAAT AAAATAT GAT TTGT GGAT ATGAA 
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FIGURE 72 

MMMVRRGLLAWI SRWVLL^^ 

EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKL3SIMAN 
TLIWIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTVVYFGKEEINEVKGILENTSKAANFRNF 
TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNLIVVRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHL 
RKQKQKT S SKKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites, 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218/ 311-317, 339-345, 393-399 

Amidation site* 

amino acids 377-381 
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FIGURE 73 

GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATTTGTCCTGGGGATCCA 
GAAACCCAT GAT ACCC TACT GAACAC CGAATCCCC TGGAAGCC CACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAATGCCCAGTCGCCCATCGATATTCAGACAGACAGTGTGACATTTGACCCTGATTTGCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 
TGAAGCC AC AT T TGCAGAGCTCCACAT T G TACATTAT GAC T C T GAT TCCTATGACAGC T TGAG T G 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AATATAGCTTATGAACACATTCTGAGTCACTTGCATGAAGTCAGGCATAAAGATCAGAAGACCTC 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GCT CGCTCACAACT CCC CC T T GC T AC CAGAGT G TGCT CT GGACAGT T T TT TATAGAAGG T CCCAG 
ATTTCAATGGAACAGCTGGA7\AAGCTTCAGGGGACATTGTTCTCCACAGAAGAGGAGCCCTCTAA 
GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
C CGAAAGAGTGTGG TC T TCACCT CAGCACAAGC CACGACTGAGGC ATAAA T TCCT TCT CAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCCTGGACATCTCTTAGAGAGGAAT 
GGACCCAGGCTGTCATTCCAGGAAGAACTGCAGAGCCTTCAGCCTCTCCAAACATGTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
C T CCTTAGGATAAAGAGTTGCT GT TGAAG T TGTATAT T T TT GATCAATATAT T T GGAAATTAAAG 
TTTCTGACTTT 
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FIGURE 74 

MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFTSAQATTEA 

Important features of the protein: 
Signal peptide: 
amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 7fi 

TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTTTGCCTjWVCCTGCAAACATCACCTTCTTATCCATCAACATGAAGA^TGTCCTACA 
ATGGACT CCACCAGAGGG TCT T CAAGGAG TTAAAG T TACT TACACTGTGCAG TAT TTCATCACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGTGGAAGAGAAATCCAGAAGACC T TCC TGT TTCCATGCAACAAATATACT CCAAT CTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
TCTGTGATGGGCTATTCCATCTACCGATATATCCACGTTGGCAAAGAGAAACACCCAGCAAATTT 
GATTTTGATTTATGGAAATGAATTTGACAAAAGATTCTTTGTGCCTGCTGAAAAAATCGTGATTA 
ACTTTATCACCCTCAATATCTCGGATGATTCTAAAATTTCTCATCAGGATATGAGTTTACTGGGA 
AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
GGAAGAGGAGGTGAAACATT TAGGG TAT GCT TCGCAT TT GAT GGAAATT T T T TGTGAC TCTGAAG 
AAAACACGGAAGGTACT TCTCT CACCCAGCAAGAGTC CC T CAGCAGAACAATACCCCC GGATAAA 
ACAGT CAT T GAATATGAATATGAT GT CAGAACCAC TGACATT T G TGCGGGGCC TGAAGAGCAGGA 
GCT CAGT TT GCAGGAGGAGGTGTC CACACAAGGAACATTAT T GGAG TCGCAGGCAGCGT T GGCAG 
TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAAT GAAACCTATCTCATGCAAT TCATGGAGGAATGGGGGT TATA 
TGTGCAGATGGAAAAC TGA TGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 
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FIGURE v6 

MS YNGLHQRVFKELKLLTLCS I S S Q I GPPEVALTTDEKS I SWLTAPEKWKRNPEDLPVSMQQI Y 
SNLKYNVSVLNTKSNRTWSQCV^^ 

DQS SE FKAKI I FW YVLP I S I TVFLFSVMG YS I YR YIHVGKEKHPANL I L I YGNE FDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 



GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAG ATGG CCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGC T GAAGGACCACAAC GC CACCAGCATCC TGCAGC AGC TGCC GC TGCTCAG T GC CATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
I TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
( CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
% TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
j CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
I CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
\ TCCTGCCATCAAGGGTGACACCAT T CAGC TCTACC TGGGGGCCAAG T TGT TGGACTCACAGGGAA 

AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAAGATCCCG 
| TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
] ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
I TGATCAATGAAAAGGCTGCAGATAAGCTGGGATCTACCCAGATCGTGAAGATCCTAACTCAGGAC 
\ ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
: TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
T T TACAC CAAAGGT GACCAAC T TATACTCAAC T TGAATAACAT CAGC TC TGATCGGAT CCAGC TG 
ATGAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 

MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGGI PVLGS LVNTVLKH 1 1 WLKYI TANI LQLQVKPSANDQELLVKIPLDMVAGFNTPLVKT I VE 
FHMTTEAQAT I RMDTS AS GPTRLVLS DCATSHGSLRI QLLYKLS FLVNALAKQVMNLLVPS LPNL 
VKNQLCPVIEAS FNGMYADLLQLVKVPISLSIDRLE FDLLYPAIKGDTIQLYLGAKLLDSQGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDVVKAAV^ 

EKAADKLGSTQIVKILTQDTPEFFIDQGHAPCVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQLI LNLNN I SSDRI QLMNSGI GWFQPDVLKN 1 1 TE 1 1 HS I LLPNQNGKLRSGVPVSLVKALG 
H* FE AAE S S L T KDALVL T PA S L WKP S S P VS Q 

Q 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

5 - 

C| Glycosaminoglycan attachment site. 

Ill amino acids 412-415 

n 

''1,1 

LBP / BPI / CETP family proteins. 

amino acids 407-457 



Cfl 
ill 

in 
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FIGURE 7Q 

GAGAGAAGTCA.GCCTGGCAGAGAGACTCTGAAATGAGGGATTAGAGGTGTTCAAGGAGCAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAGTGTGACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
% ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
?« CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 

y * 

y| CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 

Lfl CAAGCCCAACC TCT TGCCAC AAGGAGCTC TCCAAGGCC T GGT CAACC TCCCAAAG T CAAGAGTGA 

2f GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 

'fr\ GTCAGAAGGTGCTGCTGAGGATAGACTGACTT TGGCCAT TGGATTGAGCAAAGGCAGAAATGGGG 

fjf GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 

Q TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 

x j$ GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 

S ; ! TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 

1 If 

GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACTAATTT GTGCATGAACTGAAATAAAACCATCCTACGGTAT CCAGGGAACAGAAAGCAGGATG 
CAGGATGGGAGGACAGGAAGGCAGCCTGGGACAT T TAAAAAAATA 
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FIGURE 80 



MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 
IYSTLLGLPADIQAAQAMMVTSSAISSLACIISWG^CTVFCQESRAKDRVAVAGGVFFILGGL 
LGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCSSQRNRSNYY 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein; 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, . 117-140, 163-182 

N-glycosylation site . 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 46-59 



App_ID=10063570 



Page 230 



FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTC.CTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGG ATGG TTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
CCATGGGAAGAGATACTCCCCGGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
ACT GCCAGACT CCTGC TGCCAAGCCT GCAAAGATGAGGCAAG T GAGCAATCGGAT GAAGAGGACA 
GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
CAGACCCAAGGGAGCAGGCAGCACAACTGTCAAGATCGTCCTGAAGGAGAAACATAAGAAAGCCT 
GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CAC CGAGTACCCCTGCCGTCAGCCCGAGAAAGTGGC T GGGAAGTGC TGCAAGAT TT GC CCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
C T CGGACT TGGTGGAGATC T AC CT CT GGAAGCTGG TAAAAGAT GAGGAAAC TGAGGCT CAGAGAG 
GTGAAG TACCT GGCCCAAGGCCACACAGCCAGAATCT TC CACT TGAC TCAGAT CAAGAAAGTCAG 
GAAGCAAGAC T TCCAGAAAGAGGCACAGCAC T TCC GACT GCTC GCTGGCCCCCACGAAGGT CAC T 
GGAACGT C TTCCTAGCCCAGAC CC TGGAGCTGAAGGT CACGGC CAG TCCAGACAAAG T GACCAAG 
ACATAACAAAGACC TAACAGT T GCAGATATGAGCTG TATAAT T GT TGTTAT TAT AT AT TAATAAA 
TAAGAAGT T GCAT TAC CC TCAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 

MVPETOVLSSLLGLALLWFPLDSHARARPDM^^ 

GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 83 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GT GGCTTCCGCACC TTGAGCATT AGGCCAGTT CT CCTCTTCT CT CT AATCCAT CCGTC ACCT CT CCTGTCA 
TCCGTTTCCAT GCC GTGAGGTCCATTC ACAGAAC AC ATCCATGGCTCTC AT GCT C AGTTTGGTT CTGAGTC 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CCAGTTCTCTAGCGTGGTCCACCTCTACAGGGACGGGAAGGACCAGCCATTTATGCAGATGCCACAGTATC 
AAGGCAGG AC AAAACTGGTGAAGGAT TCTATT GC GG AGGGGCGC ATCTCT CT GAGGCTGG AAAAC ATT ACT 
GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
^ AC AGGTGTCAGCACTGGGCT CAGT TC CTCT CAT T TCC ATC ACGGGATATGTT GAT AGAGACATCCAGC TAC 

j Zl TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
'Zl ACAGACTCCAGGACAAACAGAGACATGCATGGCCTGTTTGATGTGGAGATCTCTCTGACCGTCCAAGAGAA 
I \\ CGCCGGGAGC AT AT CCTGT T CC AT GCGGCATGCT CATCTG AGCCGAGAGGTGGAATCC AGGGTAC AGATAG 

ff\ GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
%\ TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 
r | GAGAAGAAAGCACGGACAGGCAGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAG 
« AGACGGCT CACCCGAAGCTCT GCGTTTCTG AT CT GAAAACTGT AAC CC AT AGAAAAGC TCCCCAGG AGGT G 

CI CC TC ACTCTGAGAAGAGATT TACAAGGAAG AGTGTGGTGGCT T C TC AG AGTT TCC AAGCAGGGAAACATT A 

y I CT GGGAGGT GGACGGAGG ACAC AAT AAAAG GT GGCGCGTGGGAGTGTGCCGGGAT G ATGTGGACAGGAGG A 

^ AGGAGT ACGT GACT TTGT CT CC CGATC ATGGGTACT GGGTCCT CAG ACTG AATGGAGAACAT TTGT ATTT C 

ACATT AAATC CC CGT T TT AT CAGCGT CT TCCC CAGG ACCCCACCTACAAAAATAGGGGTCTT CC TGGACT A 
JJ ; j TGAGTGTGGGACCAT C TCCT TCTTCAACAT AAAT GACC AG T CCCTT ATTT AT ACCCTGACAT GT CGGTTT G 

AAGGCT T ATT GAGGCC CT ACAT TGAGT ATCCGTC CT AT AATGAGC AAAAT GGAACTCCCAT AGT C ATCTGC 
CCAGT C ACCCAGGAAT CAGAGAAAGAGGCCTCTT GGCAAAGGGCCT CTGC AAT CCC AG AGACAAGCAACAG 
TGAGT CCTCCTC AC AGGCAACCACGC CCTT CCT C CCCAGGGGTGAAATGTAGGATG AATCACAT CCC ACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATGAAGGGGGACT GGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCT GAGCTGGGAGGGAAGAAGG 
CT GACATT ACAT TT AGTTTGCT CT CACTCC AT CT GGCT AAGT GATC TTGAAATACC ACCTCT CAGGTGAAG 
AACC GTCAGGAATT CCC ATCTC AC AGGCTGT GGT GT AG ATT AAGT AGACAAGGAATGTGAAT AATGCTT AG 
AT CT T ATTGATGACAGAGTGTATC CTAATGGT TT GTTC ATT ATATT ACACTT TC AGTAAAAAAA 
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FIGURE 8 J. 

MALMLS LVLS LLKLGSGQWQVFGPDKPVQALVGEDAAFS C FLSPKTNAEAMEVRFFRGQFS SWH 
LYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGSVPLISITGYVDRDIQLLCQSSGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGLFDVEISL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKI QAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVS DLKTVTHRKAPQEVPHSEKRF 
TRKS WASQS FQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLS PDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 8 s 



AACAGACGT TCCCI'CGCGGCCCTGGCACC TC TAACCC CAGACATGC TGC TGC'i'GCT GCTGCCCCT 
GCTCTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGTTCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCT CGGGCAG TGTGGGAGGAGACT CGGGACC GAT T CCACCT CCTT G 
GGGACCCACATACCAAGAAT T GCACCCTGAGCATCAGAGATGCCAGAAGAAG TGAT GCGGGGAGA 
TACT TCT TT CGTATGGAGAAAGGAAGTATAAAATGGAAT TATAAACATCACCGGCT CT CT GTGAA 
H TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
y AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
% GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
Li GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
iff ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 
"* 5 J GACGGCACAG TATCCACAGT CT TGGGAAATGGC TCAT CT C TG T CAC T CCCAGAGGGCCAGT CTCT 

^ GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
% % GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
\fi CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 
Q GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 
JIJ CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
JJ ; ; TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 
AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCT T GGGACTCGCGGGGACAGGAGGCCAC TGACACCGAGTACTCGGAGAT CAAGATC CACAG 
ATGAGAAAC TGCAGAGAC TCAC CC TGAT TGAGGGATCACAGCC CCTCCAGGCAAGGGAGAAGTCA 
GAGGCTGATTCTTGTAGAATTAACAGCCCTCAACGTGAT GAGCTATGATAACACTATGAAT TATG 
TGCAGAGTGAAAAGCACACAGGCT TTAGAGTCAAAGTAT CTCAAACCTGAATCCACACTGTGCCC 
TCCCTTTTATTTTTTTAACTA2y^AGACAGACAAATTCCTA 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
MfTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLS IRDARRSDAGRYFFRMEKGS IKWNY 
KHHRLS VNVTAL THRPN I L I PGTLES GCPQNLTCS VPWACEQGT PPMI S W IGTS VS PLDPS TTRS 
SVLTL I PQPQDHGT S L TCQVT FPGA5VTTNKTVHLNVS YPPQNLTMTVFQGDGTVSTVLGNGS SL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGS QQVYLNVS LQSKAT S GVT QGWGGAGATALVFL S FCVI FVWRS CRKKS ARPAAGVGDTG I E 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 
amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 



AppJD=10063570 



Page 236 of 3 18 



FIGURE 87 



AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CCAAGGAAAGTGCAGCTGAGACTCAGACAAGATTACAATGAACCAACTCAGCTTCCTGCTGTTTC 
TCATAGCGACCACCAGAGGATGGAG TACAGAT GAGGCTAATACT TACTTCAAGGAATGGACC TGT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
CAAC TACAACACC T T TGGAT CT GCAGAGGCGGCCACGAGCGAT GAC TACAAGAACCCT GGC TAG T 
ACGACATCCAGGGCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
AGAAACAGCTCCC TGCTGAGG TACCGCACGGACAC TGGC T TCC TCCAGACACTGGGACATAATCT 
GTTTGGCATCTACCAGAAATATCCAGTGAAATATGGAGAAGGAAAGTGTTGGACTGACAACGGCC 
CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
GGCCAGCGGGAAT TCAC TGCGGGAT T TGT TCAG T TCAGGG TAT TTAATAACGAGAGAGCAGCCAA 
CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
GATACT T TCCAGAGGCCAG TCCCCAGCAGTG T GGAGAT TT TT C TGGT T T TGATT GGAGTGGATAT 
GGAACT CATGT TGGT TACAGCAGCAGCCGTGAGATAACT GAGGCAGCTGTGC T T CTAT TCTATCG 
T TGAGAGT T T T GTGGGAGGGAACCCAGAC GTC TCC TCCCAACCATGAGAT CCCAAGGATGGAGAA 
CAACTTACCCAGTAGCTAGAAT GT TAATGGCAGAAGAGAAAACAATAAATCATATTGACTCAAGA 
AAAAAA 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWTDNGPVI PWYDFGDAQKTAS YYS PYGQRE FTAGFVQ FRVFNNERAANALCAGMRVTGCN 
TEHHC I GGGG YFPEAS PQQCGDFS G FDWS G YGTHVGYS SSREI TEAAVLL FYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 80 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTGAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTYTGGGCTGAAAAAGAAACCCTTC TGA TTACCTTCA 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
T TAAGACT TATATACAGT TT TAGGGGACAAT TAAAAAAAAAAAA 
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FIGURE qo 

MGRVSGLVPSRFLTLLAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWECTTYWYIFVFCSALPAVTEMALFV 
TVFGLKKKPF 



Transmembrane domain: 

amino acids 12-28 {type II), 51-66, 107-124 
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FIGURE Qi 



CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAA GATG CAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
CCAAC AC AAT GC CACAGG CC AG GGAAACAT C T C CAT C AGC C TCGTGCCCCC C AGT AAAGC T G TAG 
AGT TCCACCAGGAACAGCAGAT C TTCATCGAAGCCAAGGCCTC CAAAATC T TCAACT GCCGGATG 
GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
TCTACAT CGCC T TC TACAGCACGGACTAT CGGC TGGT CCAGAAGGTGTGCCCAGAT TACAACTAC 
CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
TGGAAG G AC AGG CC T G C C CAT GCAG GAGAC CAT CT G GAC AC CG GGCAG GGAAGG GGT T G GG CC T C 
Q AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGT GGCAG 

LI! AGAAAGGGTCCCAAGTGC TGG TCC CAACC TGAAGC T GTGGAGT GAC TAGATCACAGGAGCAC TGG 

AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 
TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 



u 



m 



m CCT TGGTT CT TGCCATCC TGAGGAAAGATAGCAACAGGGAGGGGGAGATT TCAT CAG T GT GGACA 

GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCATCCTTAAGCTAAGACAGGACGATTGTGGTCCTCCCACACTAAGGCCACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLIAPPGEAWGILGQPPmPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNI S I SLVPPSKAVEFHQEQQI FIEAKASKI FNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q3 



CGGTGGCC ATGA CTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGAT GGACCAACACAGAAATATCTGCT GATC T TTGGAGCGTT TGTCTCTGTC TAT AT CCAAGAA 
ATGT T CCGAT TTGCATAT TATAAAC T CT TAAAAAAAGCCAGTGAAGGT T TGAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCTGAAACTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
AACCTCAGGGAACCAGCACT TC CCAAA.CCGCAGAC TACAT CT T TAGAGGAAGCACAACTGTGCCT 
TTTTCTGAAAATCCCTTTTTCTGGTGGAATTGAGAAAGAAATAAAACTATGCAGATA 
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FIGURE QA 

MTAAVFFGCAF I AFGPALALYVFT IAI EPLRI I FL I AGAFFWLVSLL I SSLVWFMARVI IDNKDG 
PTQKYLLI FGAFVSVYIQEMFRFAYYKLLKKASEGLKS INPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 



Important features of the protein: 
Signal peptide: 

U; amino acids 1-19 

01 Transmembrane domains: 

Ul amino acids 32-51, 119-138, 152-169, 216-235 

u! 

Glycosaminoglycan attachment site. 



a 
in 



Kit 



amino acids 120-123 

Sodium: neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE 

AAT T TT T CACCAGAGTAAACT T GAGAAACCAAC TGGACCT TGAGTAT T GTACAT T T TGCCTCGTG 
GACCC AAAGGTAGCAAT C TGAAACATGAGGAGTACGATT C TAC TGT TTTGTCTTC T AGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACTAC CAAACCAACAGCAGT CAAATCAGG TC T T TC C TTC T TTAAGTC TGATACCATTAACA 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGTAC 
CCAGACC CACCCATTGACCCT GGGAGGGT TGAATGTACAACAGCAACTGCACCCACATG T G T TAC 
CAAT T TT TGTCACACAAC T TGGAGCC CAGGGCACTAT CCTAAGCTCAGAGGAAT TGCC ACAAAT C 
TTCACGAGCCTCATCATCCAT TCCT TGTTCCCGGGAGGCATCCTGCCCACCAGT CAGGCAGGGGC 
TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
GCAGGCATCCAAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAATTCA 
G TAAG CTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
ATTGAT TATATTATGGAATAGATTGAGACACAT TGGATAGTCT TAGAAGAAATTAATTCT TAAT T 
TACCTGAAAATATTCTTGAAATTTCAGAAAATATGTTCTATGTAGAGAATCCCAACTTTTAAAAA 
CAATAATTCAATGGATAAATC T GTC T T T GAAATATAACAT TAT GCTGC CTGGAT GATATGCATAT 
TAAAACATATTTGGAAAACTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE q6 



MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPTPSG 
T DD D FAVT T PAG I QR S T HAI E E AT TE S ANG I Q 

Signal peptide: 
amino acids 1-16 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTGAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAG TACCAGCTAC AGGTCACCC T GGAGAT GCAGGAT GGACATGT CTT GTGGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
ACCAGCCT TGACCACGCCC T GGAGAGGACC TACCAGCTGT T GGTACAGGTCAAGGACATGGGTGA 
CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
CCCTAGAGCCTATCCACC TGGCAGAGAATC TCAAAG T CCTATACCCGCACCACATGGCCCAGG TA 
CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
TGCAGAGGGAAACC TC TACGTGACCAGAGAGC TGGACAGAGAAGCCCAGGC TGAG T AC CT GC TCC 
AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCGCCGGCTCCCCCA 
AT TCC CACGT T G T G TAT CAGC T CCT GAGC C C T GAGCC TGAGG AT GGGG TAGAGG GGAGAGCC T T C 
CAGG TGGACCCCAC T TCAGGCAG TG T GACGCTGGGGGTGCTCCCAC T C CGAGCAGGCCAGAACAT 
CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
TCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGATTGGGCCT 
ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
CCT CGAGCCCGCCT TCCGCCTCATGGAT T T TGCCAT T GAGAGGGGAGACACAGAAGGGACT T T TG 
GCCTGGAT TGGGAGCCAGACTCTGGGCATGTTAGAC T CAGAC TC TGCAAGAACCTCAGT TAT GAG 
GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
C GAAGCT G TCGGCAGTGGGCAT CC T T GTAGGCACCC T GGTAGCAATAGGAATC T TCC T CATCC T C 
AT T T T CACCCAC TGGACCAT G T CAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGTGCCCCT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
T CCCCTGGGAGAGAGCCCAGCACCCAAGAT CCAGCAGGGGACAGGACAGAGTAGAAGC CCC TCCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE q8 



MVPAWLWLLCVSVPQALPKAQPAELSVEVPSNYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
S T SLDHALERT YQLLVQVKDMGDQASGHQATATVEVS 1 1 ESTWVSLE P IHLAENLKVL YPHHMAQ 
VHWSGGDVHYHLESHPPGPFEWAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
P I S LPEDVE PGTLVAMLTAI DADLEPAFRLMD FAI ERGDTEGT FGL DWE P DSGHVRLRLCKNLS Y 
f!\ EAAPSHEVVVWQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQES YEAS VP I SAP AGS FLLT 

g|1 I QPS DP I SRTLRFSLVNDSEGWLC I EKFSGE VHTAQSLQGAQPGDT YTVL VEAQDTALXLAPVPS 

III QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
^ j REHI I PVWSHNAQMWQLLVRVI VCRCNVEGQCMRKYGRMKGl^TKLS AVG I LVGTLVAI GI FL I 

j:| LI FTHWTMSRKKDPDQPADSVPLKATV 

2 

Q Signal peptide: 

111 amino acids 1-18 

~~ Transmembrane domain: 

y . 

f|j amino acids 762-784 
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FIGURE qq 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGTCCAAGAT TCTTCCCAGGAACACAAACGTAGGAGACCCACGCTCCTGGAAGCACCAGCCT TTA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATCTAGGAC CCAGGCATC T TGCT TTCCAGCCACAAAGAGACAGATGAAGATGGAGAAAGGAAAT G 
TTCTCCTTATGTTTGGTCTACTATTGCATTTAGAAGCTGCAACAAATTCCAATGAGACTAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGT CACCAAC T C TGAGT T CCATACAACCTCCAGTGGGATCAGCACAGC CACCAACTCTGAG 
TTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 
GGCCAGCACAGCCACCAACTCTGAGTCCAGCACACCCTCCAGTGGGGCCAGCACAGTCACCAACT 
y ; CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
' f ;% AGTAGGGCCAGCACTGCCACCAACTCTGAGTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCAC 
2; CAACT CTGACT CCAGCACA&CCTCCAGTGGGGC TAGCACAGCCACCAACTC T GAGTCCAGCACAA 

y CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
01 GCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAG 
III AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
fw GCACAGC CACCAAC TC TGACTCCAGCACAGTGTCCAGTGGGGC CAGCAC TGC CACCAACTCTGAG 

!f j TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
]y GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACT 
Q C TGAGTCCAGCACAGTGTCCAGTGGGATCAGCACAG T CACCAAT TCT GAG TCCAGCACACCCTCC 

3 AG T G GGGC C AACAC AG CCAC C AAC TCT GAG TCC AGTACGAC CT C C AG T GG GG CC AACACAG C C AC 

f% CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 

CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
GCCACCAACTCTGACTCCAG»C^CCTCCAGTGAGGCCAGC^^^ 
Q CACAGTGTCCAGTGGGATCAGCACAGTCACCAAT TCTGAGTCCAGCACAACCTCCAGT GGGGCCA 

fj-J ACACAGCCACCAAC TC TGGGTCCAG TGTGACC TC TGCAGGCTC TGGAACAGCAGC T C T GACTGGA 

ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
K GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
* u TCTTCTTCTGTGTGAGAAA.CAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 

CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
G TGGAGT CC TAACT GG TTCT GGAGGAGACCAGTATCATCGATAGCCATGGAGAT GAGC GGGAGGA 
ACAGC GGGC CCTGAGCAGCCCCGGAAGCAAGTGCCGCAT TCT TCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AATCTTGAAGAAGGTATTCCTCACCTTTCTTGCCTTTACCAGACACTGGAAAGAGAATACTATAT 
TGCT CAT T T AGC TAAGAAATAAATACAT C TCATC T AACAC ACACGACAAAGAGAAGCT GT GC TT G 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CAT CT GGCATTCAAAATC T CCACAGTAAAATCCAAAGACCT CAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 100 



MKMQKGNVLLMFGLLLHLEAATNSNET S TSANTGS SVI SS GASTATNSGS S VT SSGVSTAT I SGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSSSSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGL FAGL FFC VRNSL S LRNT FNT AVYHPHGLNHGLGPGPGGNHGAPHRPRWS PNWFWRRPVS S I 
AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain : 

amino acids 510-532 
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FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTGGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGATGAT GTGATATATGCATTCCAGGGGAAGGGAAAT TGT GG T GCT TC TGAACCCATGGT CAAT T 
AACGAGGCAGTTTCTAGC TACT GCACGTACTTCATAAAGCAGGACTCTAAAAGCTTTGGAATCAT 
GGT GTCATGGAAAGGGAT T TACTT TATAC TGAC TC TGT T T T GGGGAAGCT T TT T TGGAAGCAT T T 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
GAT TATAACT GGGGATGCATT T GT TC CT GGAGAAAGAAGTGTCAT TATCATGAACCAT CGGACAA 
GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 
TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
CT T CAT T CATAGGAAATGGAAGGAT GACAAGAGCCAT TTCGAAGACATGAT TGATT AC TT TT GTG 
ATAT TCACGAACCACTTCAACTCC TCATAT T CCCAGAAGGGACTGATC TCACAGAAAACAGCAAG 
TCT CGAAG T AATGCATT TGCTGAAAAAAATGGACT TCAGAAAT ATGAATATGT T T T ACAT CCAAG 
AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
ATATCACTGTGGCGTATCC T CACAACAT T CCTCAAT CAGAGAAGCACC TCCTCCAAGGAGACTT T 
CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 
CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 
GGGAGAAGAAT TT T TAT T T TAC CGGACAGAG T G TCATTCCACCT TGCAAGT CTGAACT CAGGGTC 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATT TGTACAGT C T TGT TAAGTGG TAT T T TATAATCAC CAT T GTAATC T T TG TGC TGCAAGAGA 
GAATATTTGGTGGACTGGAGAT CATAGAACTTGCATGTTACCGACT TTTACACAAACAGCCACAT 
T TAAATT CAAAGAAAAAT GAG TAA GAT TATAAGGT T TGCCATGTGAAAACC TAGAGCATAT T T TG 
GAAATGT TC TAAACC TT TCTAAGCT C AGATGCAT T T TTGCATGAC TATGTCGAATAT TTCT TACT 
GCCATCAT TATTTGT TAAAGATAT T T TGCAC T TAATT TT GT GGGAAAAATATTGCTACAAT T TT T 
TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 
GGGCCAGAATAT TAT TAAACAATCATCAGGC T T T TAAA 
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FIGURE 102 



MHSRGRE I WLLNPWS INEAVSS YCTYFIKQDSKSFGIMVSWKGIYFI LTLFS7GS FFGS I EMLSP 
FLPLMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVI I TGDAFVPGERS VT IMNHRTRMDWM 
FLWNCLMRYSYLRLEKICLKA.SLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVIPPCKSELRVLWKLLSILYWTLFSPAMCLLIYLYSLVKWYFIITIVIFVLQERIFGG 
LEI IELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE 103 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCC AAAT CATCCATCCACCG C T GC TGTCATC TGTTT T CAT AG T GTGAGATCAACCCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACCTCTACAGAGATGGGGAAGACTGGGAATCTAAGCAGATGCCACAGTATCGAGGGAGAACTGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
! ss : AGGATTTGTCTTCAGACTCCAGAGCAAATGCAGATGGGTACAGCCTGTATGATGTGGAGATCTCC 
% AT T ATAGT CCAGGAAAATGCTGGGAGCATAT TG TG T T CCATCCACC TTGC TGAGCAGAGT CATGA 

■jjjj GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
Q TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
01 AAATCCAAAGGGAAAATCCAGGCGGAAC TGGAC TGGAGAAGAAAGCACGGACAGGC AGAAT T GAG 

i j [ AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
?~ T T T C T GAT C T GAAAAC T G TAAC CCAT AGAAAAGC TCC CCAG GAG GT GC C T C AC T C T GAGAAGAGA 

P \ TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGA 
'H CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
Q AT G T G AC T T T GT C T C CCAAC AAT GG GT AT T G GG T C C T CAGAC T GACAACAGAAC AT T T G TAT T T C 

ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
^ GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
h :f TGACATGTCAGTTTGAAGGCTTGTTGAGACCCTATATCCAGCATGCGATGTATGACGAGGAAAAG 
Uf GGGACTCCCATATTCATATGTCCAGTGTCCTGGGGATGAGACAGAGAAGACCCTGCTTAAAGGGC 
C! CCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCTCCCGACAGGTGGCCCCAGCTTCCTCT 
f|j CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
^ TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
TCAGAAGCCATGGC T GCCCTGAAGTGGGGACGGAATAGACT CACAT TAGGT TTAGTT T GT GAAAA 
01 CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 
ACAG T GAT T CCTGCCTCACAGGTGAAGAT TAAAGAGACAACGAATG TGAAT CAT GCT T GCAGGT T 
T GAGGGCACAGTGT T TGC TAAT GATGTGT TT T TAT AT TATACAT T T T CCCACCATAAACT C TGT T 
TGC T TAT TCCACAT TAAT T TACTT T T CT C TATACCAAAT CACCCATGGAATAG T TAT T GAACACC 
TGC T T TGT GAGGCTCAAAGAATAAAGAGGAGGTAGGATT T T TCACTGAT TCTATAAGCCCAGCAT 
TACCT GATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTC CATATCC 
C TC AT T AAC ACAGACACAAAAAT T CT AAATAAAAT TT TAAC AAAT TAAAC T AAACAAT ATAT T T A 
AAGAT GATATATAACTAC TC AGTGTGGT T TGTCCCACAAATGCAGAGT T GGTT TAATAT TTAAAT 
ATC AACCAGTGTAAT TCAGCACAT TAATAAAGTAAAAAAGAAAACCATAAAAAAAAAAAAAAA 
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MAFVL I LVLS FYEL VS GQWQVT GPGKFVQALVGE DAVFS CS L FPE TSAEAMEVRFFRNQ FHAWH 
LYRDGEDWESKQMPQYRGRTEFVKDS I AGGRVSLRLKNITPSDI GLYGCWFSSQI YDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
I VQENAGS I LCS I HIAEQSHEVE'SKVLIGETFFQPS PWRLAS I LLGLLCGALCGWMGMI IVFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKS WAS QG FQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTL S PNNGYWVLRLT TEHL YFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
w TPIFICPVSWG 

*\ 

|| Signal peptide: 

\\ amino acids 1-17 

s ; 

d • • ' 

4 Transmembrane domains: 

amino acids 131-150, 235-259 

3 • 

n . 

s;j ■ 

?! 

2 i 

II ■ 

n • 
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CCTT CACAGGAC T C TTCATT GCTGGT TGGC AAT6ATGT ATCGGCCAGATGTGGT GAGGGCT AGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CTCACTGTTCATTATGTGAGATATAATCAAAAGAAGACCTACAATTACTATAGCACATTGTCATTTACAAC 
TGACAAACTATATGCTGAGTTTGGCAGAGAGGCTTCTAACAATTTTACAGAAATGAGCCAGAGACTTGAAT 
CAAT GGTG AAAAAT GC ATTT TAT AAATCTCCATTAAGG GAAGAATTTGTCAAGT CTCAGGTT AT C AAGTTC 
AGTC AACAGAAGCATGGAGT GT T GGCTC AT AT GCTGTT GATT TGT AGATT T C ACTCT ACTGAGGAT CCTG A 
AACTGTAGATAAAATTGTTCAACTTGTTTTACATGAAAAGCTGCAAGATGCTGTAGGACCCCCTAAAGTAG 
ATCCTCACTCAGTTAAAATTA2\AAAAATC2\ACAAGACAGAAACAGACAGCTATCTAAACCATTGCTGCGGA 
AC ACGAAG AAGT AAAACTCT AGGT C AGAGTCT CAGG AT CGTTGGTGGG AC AGAAGT AG AAGAGGGT GAATG 
GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
T T GT GAGT GCT GCT C ACTGT T T T AC AACAT AT AAGAACCCTGCC AG ATGG ACTGCTTCCT TT GGAGT AAC A 
ATAAAACCTTCGAAAATGAAACGGGGTCTCCGGAGAATAATTGTCCATGAAAAATACAAACACCCATCACA 
T GAC T ATG AT AT T T CTCTT GCAGAGCTTTCTAGCCCTGTTCCCT AC AC AAATGCAGTACAT AGAGT TTGT C 
T C CCTGATGC AT CC T ATG AGTT T C AACCAGGT GATGTG ATGT TT GTGACAGGAT TTGGAG C ACT GAAAAAT 
GATGGT TACAGT CAAAATCAT CT T CG ACAAGC ACAGGT GACT CT C ATAGACGCTAC AACTTGCAATGAACC 
T C AAGCTT AC AAT GACGCCATAAC TCCT AG AATGTT AT GTGCTGGCTCCT TAGAAGGAAAAACAGATGC AT 
GCCAGGGTGACTCTGGAGGACCACTGGTTAGTTCAGATGCTAGAGATATCTGGTACCTTGCTGGAATAGTG 
AGCTGGGGAGATGAATGTGCGAAACCCAACAAGCCTGGTGTTTATACTAGAGTTACGGCCTTGCGGGACTG 
GATTACTTCAAAAACTGGTATCTAAGAGACAAAAGCCTCATGGAACAGATAACATTTTTTTTTGTTTTTTG 
GGTGTGGAGGCC AT TTTT AGAGATAC AG AATT GGAG AAGACT TGC AAAAC AGCT AGATTT GACT GATCTCA 
ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GATC AACTCT GT CATCTGTGAGCAAT AGTT GAAACT TT ATGT AGAT AG AG AAAT AGAT AATACAAT ATT AC 
ATTACAGCCTGTATTCATTTGTTCTCTAGAAGTTTTGTCAGAATTTTGACTTGTTGACATAAATTTGTAAT 
GCAT AT AT ACAATT TGAAGC AC TCCTTTTCTT CAGT TCCTCAGC T CCTCTCATT TC AGCAAATATC CATT T 
TCAAGGTGCAGAACAAGGAGTGAAAGAAAATATAAGAAGAAAAAAATCCCCTACATTTTATTGGCACAGAA 
AAGTATTAGGTGTTTTTCTTAGTGGAATATTAGAAATGATCATATTCATTATGAAAGGTCAAGCAAAGACA 
GCAGAATACCAATCACTTCATCATTTAGGAAGTATGGGAACTAAGTTAAGGAAGTCCAGAAAGAAGCCAAG 
ATATATCCTTATTTTCATTTCCAAACAACTACTATGATAAATGTGAAGAAGATTCTGTTTTTTTGTGACCT 
AT AAT AAT TATACAAACTTCATGC AATGTACT T GTTCTAAGCAAATTAAAGC AAAT ATTT AT TT AAC ATTG 
TT AC T GAG GATGTC AACAT ATAACAATAAAAT AT AAATCACCCA 
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MMYRPDWRARKRVC WE PWVI GLVI FI SL I VLAVC I GLT VHYVRYNQKKT YNYYS TLS FTTDKLY 
AE FGREASNNFTEMSQRLE SMVKNAFYKS PLREE FVKS QVIKFSQQKHGVLAHMLL I CRFHS TED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTASFGVTIKPSKMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTL I DAT TCNE PQAYNDAI T PRMLCAG S LEGKTDACQGDSGGPLVS SDARDI WYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAATGTTCTCACTTTAACTGCAGAAACGAGGGTAGAGGAAGCAGTCATTTTGACTTACTTTCCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
GATAT GGT CACTT TGAAAGCCAGGAT GACAAAT TATGGATT ACCTAGATATCGGT GGCT TACTCA 
TGCTTGGAATTTTTTTCAGAGAGAGTTTAAGTGCTGTGGAGTAGTATATTTCACTGACTGGTTGG 
*»j AAATGACAGAGATGGACTGGCCCCCAGATTCCTGCTGTGTTAGAGAATTCCCAGGATGTTCCAAA 
CAGGCCCACCAGGAAGATCTCAGT GACC T T TATCAAGAGGGTT GTGGGAAGAAAAT GT AT TCC T T 
y TTTGAGAGGAACCAAACAACTGCAGGTGCTGAGGTTTCTGGGAATCTCCATTGGGGTGACACAAA 
yl TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
m ACAGACCAAATGATGTCC T T GAAGAATGACAAC TC TCAGCACCT GT CATG TCCC TC AGTAGAACT 

it? GT T GAAACCAAGCC TGTCAAGAAT C TT TGAACACACATCCATGGCAAACAGC TT TAAT ACACACT 

:f ) • T TGAGAT GGAGGAGT TATAAAAAGAAATGT CACAGAAGAAAACCACAAAC T TGT TT TATT GGACT 

2j TGT GAAT T TT TGAGTACATACT AT G TGT TTCAGAAATAT G TAGAAATAAAAATGT TGCCATAAAA 

CI TAACACCTAAGCATATAC TAT T CT AT GCT T TAAAATGAGGATGGAAAAGT T TCATGTCATAAGTC 

B ACCACCT GGACAATAATT GATGCCC T TAAAA.TGC TGAAGACAGATG T CATACCCACT GTGTAGCC 

"jw^j TGT GT AT GACTT TTAC TGAACACAG T TATGTT TTGAGGCAGCATGG T T TGAT TAGC AT T T CCGCA 

% TCCAT GCAAACGAGTCACAT AT GGTGGGAC TGGAGCCATAGTAAAGGT T GAT T TACT T CTACCAA 

jjjj C TAGT ATATAAAGTAC TAATTAAATGCTAACATAGGAAGT T AGAAAATAC TAATAACT T T TATTA 

Cl C TCAGCGATCTAT TCTTC TGAT GCTAAATAAAT TATATATCAGAAAAC T T T CAATAT T GGT GACT 

ffj ACC TAAATGTGATT T T TGCTGGTT AC TAAAATATTCT TACC AC TTAAAAGAGCAAGCT AACACAT 

f?j TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
J!!'; T TCAGT T CT GATAATG T TAAGAAT AACCATTAT GAAAAGGAAAATT TGTC C TGTATAGCATCATT 

— ATT TTTAGCCTT TCCTGT TAATAAAGCT TTACTAT TCTGTCCTGGGCTTATAT TACACATATAAC 

TGT TAT T TAAATAC T TAACCACTAAT TT TGAAAAT TACCAGTGTGATACATAGGAATCAT TAT TC 
AGAAT G TAGTC TGGTCT T TAGGAAGTAT TAATAAGAAAAT TTGCACAT AAC TTAG T TGAT TCAGA 
AAGGAC T TGTATGC TGT T TTTCTCCCAAATGAAGACT CT T TT TGACAC TAAACAC T T T T T AAAAA 
GCT TATC T TTGCCTTCTCCAAACAAGAAGCAATAG TC TCCAAGT CAATATAAAT TCTACAGAAAA 
TAGTGTT C T T T T TCTCCAGAAAAAT GCT TGTGAGAATCAT TAAAACATGTGACAAT T T AGAGAT T 
C T T TGT T TT AT TTCACTGAT TAATATACTGTGGCAAAT TACACAGAT TAT TAAAT T T T T T TACAA 
GAGTATAGTATATTTATTTGAAATGGGAAAAGTGCATTTTACTGTATTTTGTGTATTTTGTTTAT 
T TCT CAGAATAT GGAAAGAAAATTAAAATGTGTCAATAAAT AT T T TC TAGAGAGTAA 
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MARE DS VKC LRC LL YALNLL FW LMS I S VLAVS AWMRD YLNNVL T L TAE T RVEEAVI L T Y FP WHP 

VMI AVCCFLI I VGMLGYCGTVKRNLLLLAWYFGS LLVI FCVELACGVWT YEQE LMVPVQWS DMVT 

LKARMTNYGLPRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCVR^ 

EDLS DLYQEGCGKKMYS FLRGTKQLQVLRFLG I S I GVTQI LAMI LT I TLLWALY YDRRE PGTDQM 

MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GAT AAAGT CT T TC TC GT GAGAGCCT AGAGGCCTT AAAAAAAAAAGT GCT T GAAAGAGAAG GGGAC AAAGGAACA 
CC AGTAT TAAGAGGATT TT CCAGT GT T TCT GGC AGT T GGTCCAGAAGGATGCCT CC ATT CCT G CTTC T C AC CT G 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCT GGAGGAACACTGACCACCAGTTGGATGAGTCT CAAGGTCCTCCTCTATGTGACAACC ATGT GAAT GGGGKG 
TGGTACCACTTCACGGGCATGGCGGGAGATGCCATGCCTACCTTCTGCATACCAGAAAACCACTGTGGAACCCA 
CGC ACCT GTC T GGCT C AAT GGCAGC C ACCCCC T AGAAGGC GAC GGCAT T GT GC AAC GCCAGGC T T GT GCC AGC T 
TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
foh CT GAC C AAGC CC AGC GT CT GCT T CCAC GT CT ACT GT GGT C ATT T TT AT GACATCT GC GAC GAGGACT GCC AT GG 

Q CAGCTGCT CAGATACCAGC GAGTGCACATGCGCTCCAGGAACT GTGCTAGGCCCTGACAGGC AGACAT GCTT T G 

"W ATGAAAATGAATGTGAGCAAAACAACGGTGGCTGCAGTGAGATCTGTGTGAACCTCAAAAACTCCTACCGCTGT 
^1 GAGTGT GGGGTTGGCCGTGTGCTAAGAAGT GATGGCAAGACTT GTGAAGACGTT GAAGGATGCC ACAATAAC AA 

^ TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 

m 

f ; CTGAGGATAACCACACTTGCCAAGTCCCTGTGTTGTGCAAATCAAATGCCATTGAAGTGAACATCCCCAGGGAG 
CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
CCT CTTCT CTCTCAAGACATGTGGTACAGTGGTCGAT GT GGT GAAT GACAAGATT GT GGCCAGCAACCTCGT GA 
^ CAGGTCT ACCCAAGC AGACCCCGGGGAGCAGCGGGGACT TCATCAT CCGAACCAGCAAGCTGCT GATC CCGGTG 

ACCTGCGAGTTTCCACGCCTGTACACCATTTCTGAAGGATACGTTCCCAACCTTCGAAACTCCCCACTGGAAAT 
J^j CAT GAGC C GAAAT C ATGG GAT CTTCCCATTCACTCT GGAGATCT T C AAG GAC AAT GAGTT TGAAGAGC CTT ACC 

«S I GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGT GAGC GGC 

f% TTGGAAAGCTT GGT GGAGAGCTGCT T T GC C ACC CC CACC T CC AAGAT C GAC GAGGT CCTGAAATACTACCTC AT 

f|| CCGGGATGGCTGTGTTTCAGATGACTCGGTAAAGCAGTACACATCCCGGGATCACCTAGCAAAGCACTTCCAGG 
TCCCTGTCTTCAAGT TTGT GGGCAAAGACC ACAAGGAAGTGTT TCT GCACTGCC GGGTTCTTGTCTGT GGAGTG 
TT GGAC GAGC GTT CCCGCTGTGCCCAGGGTTGCCACC GGC GAAT GC GTC GTGGGGCAGGAGGAGAGGACTC AGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTC AC AGC ACTGCTGAAC AATGTGGCCT GGGT GGGGTTTC ATCTTTCTAGGGT TGAAAACTAAACTGTCC A 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
C AC AAAAT CAG AAGC T GGGT AT AAT AT TT C AAGT T ACAAACCCTAGAAAAAT T AAAC AGT T ACT GAAATTAT GA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
TT AT AGGGAAT TT GGAAGT GT AT C AAT AAAACAGT AT AT AATT T T 
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MPPFLLLTCLFI TGT SVS PVALDPCSAY I SLNE PWRNT DHQLDE SQGPPLCDNH VNGEWY H FTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGH FYD ICDE DCHGSCSDTSE CT CAPGTVLGPDRQTC FDENEC EQNNGGCS E ICVNL KNS Y RC ECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHWILFSLKTCGTWDVVNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLYFGIEPWHV 
SGLESLVESCFATPTSKIDEYLKYYLIRDGCVSDDSVKQYTSHDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 

N-myristoylation sites . 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxy lation site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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GAGAGAGGCAGCAGCTTGCTCAGCGGACAAGGATGCTGGGCGTGAGGGACCAAGGCCTGCCCTGCACTCGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTGCTGCCTTGGGGTGACAATCTCAGCTCCAGGCTACAGGGAGACCGGGAGGATCACAGAGCCAGCATGT 
TACAGG AT CC TGACAGTG ATCAACCT CTGAACAGCCTCGATGTC AAACCCCT GCGC AAACCCCGT ATCCCC 
ATGGAGACCTTCAGAAAGGTGGGGATCCCCATCATCATAGCACTACTGAGCCTGGCGAGTATCATCATTGT 
GGTTGTCCT C AT CAAGGTGATTCT GGAT AAAT ACT ACTTCCT CT GCGGGC AGCC TC TCCACT T C ATCCCG A 
GGAAGC AGCTGT GT GACGGAGAGC TGGACT GT CCCTTGGGGGAGGACGAGGAGC AC TGTGTC AAGAGCTT C 
CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
GCTAC AGCAGAGCT GTGG AGATTGGCCCAG AC CAGGAT CTGGAT GTTGTTG AAAT C AC AGAAAAC AGCCAG 
GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
TC AGC ATCCAGT AC GACAAACAGCACGTCTGT GGAGGG AGCATCC TGGACCCCCACTGGGTCC T CACGGC A 
GC CC AC TGCTTC AGGAAACAT ACCGATGT GTT CAACTGGAAGGT GCGGGC AGGC TCAGACAAAC TGGGC AG 
CT TCCC ATCCCT GGCTGTGGCCAAGATCATCATCATTG AATT CAACCCCAT GTACCCC AAAGACAAT G ACA 
TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
GATGAGGAGCTCAC TCCAGCCACC CC ACT CTGGATC AT TGGAT GGGGCTT TACGAAGC AG AATGGAGGGAA 
GATGT C TGAC AT AC TGCTGC AGGC GT C AGTCCAGGT C ATTGACAGC AC ACGGTGCAATGC AGACGAT GCGT 
ACCAGGGGGAAGTCACCGAGAAGATGATGTGTGCAGGCATCCCGGAAGGGGGTGTGGACACCTGCCAGGGT 
GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 
CTGCGGGGGCCCGAGCACCCCAGGAGTATACACCAAGGTCTCAGCCTATCTCAACTGGATCTACAATGTCT 
GGAAGGCTGAGCTG TAAT GCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGATCCCCCAAAGTCAGACACAGAGCAAGAGTCCCCTTGGGTACACCCCTCTGCCCACAGCCTCAGCAT 
TTCTTGGAGCAGCAAAGGGCCTCAATTCCTGTAAGAGACCCTCGCAGCCCAGAGGCGCCCAGAGGAAGTCA 
GCAGCCCTAGCTCGGCCACACTTGGTGCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCT 
CAGGGGTATTGC TAAGCC AAGAAGGAACTT TC CC AC ACT ACT GAAT GGAAGC AGGCTGTCTT GTAAAAGCC 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GC CT ACTAGAGC AAGAAAC CAGTT GT AATATAAAATGC ACTGCC C T ACTGT T GGT ATGACT ACC GTT ACCT 
AC TGTTGTCATT GT T ATT ACAGCT ATGGCC AC TATT AT T AAAGAGCTGTGT AACATCTCTGGCAAAAAAAA 
AAAA 
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MLQDPDS DQPLNS LDVKPLRKPRI PMET FRKVG I P 1 1 1 ALLSLAS 1 1 1 VWL IKVTLDKYY FLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVEIGPDQDLDVVEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVSIQYDKQHVCGGS ILDPHWVLTAAHCFRKHTDVFNWKVRAGSDKL 
GSFPSIAVAKIIIIEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 

FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPIiMYQS 
DQWHWG I VS WGYGCGGPST PGVYTKVS AYLNW I YNVWKAE L 

Transmembrane domain: 

amino acids 32-53 (typell) 
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GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACMCTTTTTC^GC^CTAAiy^AGCCACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
pi GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
U| ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
f\ CAGAAGGGGTCTGCTTAGACCACCTGGTTTATGTGACAGGACTTGCATTCTCCTGGAACATGAGG 
GAACGCCGGAGGAAAGCAAAGT GGCAGGGAAGGAACTTGTGCCAAATTATGGGT CAGAAAAGATG 
GAGG TGTTGGGT TAT C AC AAGGC AT C GAG T C T C C T GC AT T C AG T GGAC AT G TG G GG GAAG GGC T G 
Q CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
If] CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
Jf; TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
C TC CTTCCCTCTGAGAGGCCC T CC TATGTCCCTAC TAAAGCCACCAGCAAGACATAGCTGACAGG 
fJI GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 



GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
CTCAATTTAAATCATGTTCTAGTAATTGGAGCTGTCCCCAAGACCAAAGGAGCTAGAGCTTGGTT 
CAAATGATC TCCAAGGGC CCT TATAC CCCAGGAGACT T T GAT T TGAATT TGAAACCCCAAATCCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGCTACTCGGGAGGCTGAGACAGGAGAATTACTTGAACCTGGGAGGTGAAGGAGGCTGAGACA 
GGAGAATCACTTCAGCCTGAGCAACACAGCGAGACTCTGTCTCAGAAAAAATAAAAAAAGAATTA 
TGGTTATTTGTAA 
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FIGURE 114 



MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 
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FIGURE lis 

CAGCAGTGGTCTCTCAGXCCTCtCAAAGCAAGGAAAGAGTACTGTGTGCTGAGAGACCATGGCAA 
AGAAT CC TCCAGAGAAT TGT GAAGACTGTCACATT CT AAATGCAGAAGCT TTTAAATCCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GT T T T GGGGGAGCAAGCACTTC TGGCCGGAGGTACCCAAAAAAGCC TATGACAT GGAGCACAC T T 
TCTACAGCAATGGAGAGAAGAAGAAGAT T TACATGGAAATTGATCCTGTGACCAGAACTGAAATA 
T TCAGAAGC GGAAATGGCACTGAT GAAACAT T GGAAGTGCACGACT T TAAAAACGGATACACTGG 
CAT CTACTT CGTGGGTCT TCAAAAAT GT T T TATCAAAACT CAGAT TAAAGTGAT TC CTGAAT T T T 
C TGAACC AGAAGAGGAAATAGATGAGAATGAAGAAATTACCACAAC T T TC TT TGAACAGT CAGT G 
ATTTGGGTCCCAGCAGAAAAGCCTATTGAAAACCGAGATTTTCTTAAAAATTCCAAAATTCTGGA 
GAT T T GT GATAACGTGACCATGTAT T GGATCAAT CCCAC TCTAATAT CAG T T TC TGAGTT ACAAG 
ACT T T GAGGAGGAGGGAGAAGATC T T CAC T T TCC TGCCAAC GAAAAAAAAGGGATT GAACAAAAT 
GAACAGT GGGTGGTCCC TCAAGTGAAAGTAGAGAAGACC CGTCACGCCAGACAAGCAAGT GAGGA 
AGAAC TT CCAATAAATGACTATACT GAAAATGGAATAGAAT T T GATCCCATGCT GGATGAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGC TACTACCCATATCCATAC T GCTACCAAGGAGGACGAGT CATC TGTCG TGTCAT CATGCCT TG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 
TGC TGGCAACATATAATAAATGCATGCTAT TCAAT GAAT TT CT GCCTATGAGGCAT CT GGCCCC T 
GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
TTATCACCAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 



m 



MAKNPPENCEDCH I LNAEAFKSKKI CKS LK I CGLVFG I LALTL I VL FWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
EFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWINPTLISVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCC I YCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVI CRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 



III Transmembrane domain: 

y*| amino acids 25-47 (type II) 



w 
W 

m 

Cj Glycosaminoglycan attachment sites. 



N-glycosylation sites. 

amino acids 94-97, 180-183 



amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Micro-bodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CCGTGT TCCAGT ACGAAGGGCT CT GG AGGAGCTGCGTG AGGC AGAGTT C AGGCT TC ACCG AATGCAGGCCC 
TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
J* GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
%l TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 
i» GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
U\ TG GGT CC AAC ACCAAAAACAAGAAGATAT ACGATGG AGGTGC CCGC ACAG AGGACGAGGT AC AAT CTT ATC 

y] CT TCCAAGCACGAC T ATGTGTAATGCTCTAAGACCTCT CAGCACGGGCGG AAGAAACT CCCGGAGAGCTC A 

%l CC CAAAAAAC AAGGAG ATCC CATCT AGATT TCTTCTTGCTT T TGACTCAC AGC T GG AAGTT AGAAAAGCC T 

Q CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 
^ GCTGAGTTATTTATGAATTAGAGGCTATAGCTCACATTTTCAATCCTCTATTTCTTTTTTTAAATATAACT 
fti TTCTACTCTGATGAGAGAATGTGGTTTTAATCTCTCTCTCACATTTTGATGATTTAGACAGACTCCCCCTC 
'2* TTCC TCCT AGTC AAT AAACCCAT T GATGAT CT AT TT CCCAGC TT ATCCCC AAGAAAACTTTT GAAAGGAAA 

jS* GAGTAG ACCC AAAGATGT T ATTT T CTGCTGTT TGAATTTTGT CT CCCCACCCCC AACTTG GCT AGT AATAA 

*f*\ AC AC T T ACTGAAGAAG AAGC AATAAG AGAAAGAT ATTTGT AATC T CTCC AGCCC ATGATCTCGGTT TTCT T 

f\\ ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 
TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
CGCGGGTCAGAAAT TGTCCCTAGATGAATG AGAAAATT ATTT TT TTTAATTT AAGT CCTAAAT AT AGTTAA 
AATAAAT AAT GT TT T AGT AAAAT GAT ACACTATC TCTGTG AAAT AGCCTC ACCC CTAC ATGT GGAT AG AAG 
GAAATG AAAAAAT AATTGCT TTGAC ATTGT CTAT ATGGTACT TT GTAAAGTC AT GCTT AAGT AC AAATTCC 
AT GAAAAGCT CACACCTGT AAT CCT AGC ACTT TGGGAGGCT GAGGAGGAAGGAT CACTTG AGCC C AGAAGT 
TC GAGACT AGCCT GGGCAAC AT GGAGAAGCCCTGTCTCTACAAAAT ACAG AGAGAAAAAATCAGCC AGTC A 
TGGT GGCAT ACACC TGTAGT CCCAGC ATT CCGGGAGGCTGAGGT GGGAGG AT CACTTGAGCCCAGG GAGGT 
TGGGGCTGCAGT GAGCCATGATCAC ACCACTGCACT CC AGCC AGGTGACATAGC GAGATCCT GT CT AAAAA 
AATAAAAAATAAATAATGGAACACAGCAAGTCCTAGGAAGTAGGTTAAAACTAATTCTTTAA 
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FIGURE 118 



MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFT ILGLPAMLQAVRALMIVGIVLGAIGLLVS I FALKCIRI GSMEDSAKANMTLTSGIMFIVSGL 
CAI AGVS VFAKMLVTNFWMSTANMYT GMGGMVQTVQTRYT FGAALFVGWVAGGLTL I GGVMMC I A 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 



fi^ Transmembrane domains : 

l t \ amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 

GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTGTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 
CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
ACTCAATAGTGAATGTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 
CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
CAGATACT CGAT ACCTT CCCATCGCACAACCCAAAAAAGT TATCACACCGGAAAGAAGT CACCGAGCG 
TCTACTCCAGAAGTCAGTATGTGTAGTTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 
ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
AATCTTAATTACAGGAACTGTGCATCAGCTATTTATGATTCTATAAGCTATTTCAGCAGAATGAGATA 
TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 
CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 
ACGACATAGCAT TATGTACATAGATGAGTGTAACAT TTAT ATCTCACAT AGAGACATGCTTATAT GGT 
TTTATTTAAAAT GAAATGCCAGT CCAT TACACTGAATAAATAGAACTCAACTATTGCTT TT CAGGGAA 
ATCATGGATAGGGTTGAAGAAGGTTACTATTAATTGTTTAAAAACAGCTTAGGGATTAATGTCCTCCA 
T TTATAATGAAGATTAAAAT GAAGGCT TTAAT CAGCATTGTAAAGGAAATTGAAT GGCT TT CTGATAT 
GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTT GTGTATTAAATTAACATTTT TAAAACGCAGATATTTT GTCAAGGGGCT T TGCATT CAAACT GCTT 
TTCCAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATGGTTTTAGGAAAGTG 
AAAATATT T TTGTTTTT GTATT T GAAGAAGAATGAT GCAT TTTGACAAGAAATCATATATGTATGGAT 
ATATTTTAATAAGTATTTGAGTACAGACTTTGAGGTTTCATGAATATAAATAAAAGAGCAGAAAAATA 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATGCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TAT ACATTTATATTAAT AAAT TGTACATTTT TCTAAT T 
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FIGURE 120 

MM 1 HALE I AGL FLGGVGMVGTVAVTVMPQWRVSAFI ENN I 

IYDSLIALSPDLQAARGLMCAASVMSFLAFMMAILGMKCTRCTGDNEKVKAHILLTAGIIFI 
MWLIPVSWANAIIRDFYNSIVWAQKRELGEALYLGWTTALVLIVGGALFCCVFCCNEKSSSY 
RYS I PSHRTTQKS YHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains : 

amino acids 82-101, 118-145, 164-188 



AppJD=10063570 



Page 270 



FIGURE 121 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACC TGTATAAT GGAATGT GCT TACAAGGGCCAGCAGGAG TGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GAT TCAAAGGAGAAAAGGGGGAATGTCTGAGGGAAAGCTT TGAGGAGTCCTGGACACCCAACTAC 
AAGCAGTGTTCATGGAGT TCAT TGAATTATGGCATAGAT CT T GGGAAAAT T GCGGAG TG TACAT T 
TACAAAGATGCGTT CAAATAG T GCTCTAAGAGT T T TGT T CAG T GGC TCAC T TCGGCTAAAAT GCA 
GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
ATT GAAGC TATAAT T TAT T T GGACCAAGGAAGCCC TGAAAT GAATTCAACAAT TAATAT T CATCG 
CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 
ATTAT TGAAGAACTACCAAA ATAAA T GCT T TAAT T T T CAT T TGCTACC TC T TT T T T TAT TAT GCC 
T TGGAAT GGT T CAC TTAAAT GACAT T TTAAATAAG T T TATGTATACATCTGAAT GAAAAGCAAAG 
C TAAATATGT T TACAGAC CAAAGT GT GAT T TCACACT GT TT T T AAAT C TAGCAT TAT T CAT T TTG 
C TT CAAT CAAAAGTGGTT TCAATATT TT TT TTAGT T GGT TAGAATAC T T T CTTCATAGTCACAT T 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAATATAAAAGCTACCAATCTTTGTACAATTTGTAAATGTTAAGAATTTTTTTTATATCTGT 
TAAAT AAAAAT TAT T TCCAACA 
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FIGURE 122 



MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQPCAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 



App_ID=10063570 



Page 272 of 3 18 



FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT T GCCT GGAAGAATACATCAT GTT T T TCGATAAGAAGAAAT T G TAGGAT CCAGT TTTTTTTTTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCC CCTAAGCCAAAGCAAAAGACC TAAGGACGACC T T TGAACAATACAAAGG ATGG GT TT CAAT G 
TAAT TAGGCTACTGAGCGGAT CAGCT GTAGCAC TGG T TATAGCCCCCACTGTCT TACT GACAAT G 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
ATAACAGCCTTCAAAAACTTAAGTATAATCAATTTAAAGGGCTCAACCAGCTCACCTGGCTATAC 
CTTGACCATAACCATATCAGCAATATTGACGAAAATGCTTTTAATGGAATACGCAGACTCAAAGA 
GCTGAT TCTTAG T TCCAATAGAAT CT CCTAT T T TCT TAACAATACC T TGAGACCTGTGACAAAT T 
TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
AGACTGCCGCAACC T GGAACT T T T GGACC TGGGATATAACCGGATC CGAAGT T TAGCCAGGAAT G 
TCTTTGCTGGCATGATCAGACTCAAAGAACTTCACCTGGAGCACAATCAATTTTCCAAGCTCAAC 
CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
CAT AGGACAGACCATGTCCT GGACC T GGAGC T CCT TACAAAGGCT TGATT TATC AGGC AAT GAGA 
TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
TCCAACAAGCTCACATT TATT GGT CAAGAGAT T T TGGAT T CT T GGATATCCCTCAATGAC ATCAG 
TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
T TAAAGGT CTAAGGGAGAATACAAT TAT C T GTGCCAGTCCCAAAGAGC TGCAAGGAGT AAATGTG 
AT C GAT G CAG T GAAGAAC T ACAGCAT C T G T G G CAAAAGT AC TACAGAGAG GT T T GAT C T GGC CAG 
GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAGTCC CTAAAGCAAATGACT CC CAGCACCCAGGAAT T T T AT GTA 
GAT TATAAACCCACCAACACGGAGAC CAGCGAGATGC TGCT GAATGGGACGGGACCC T GCACCTA 
TAACAAATCGGGCTCCAGGGAGTGTGAGGTATGAACCAT TGTGATAAAAAGAGCTCT TAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGTCATTT.TCCTCTCATACATAATCAACCCATTGAAATTTAAATACCACAATCAATGTGAAGCTT 
GAACTCCGGT T TAATATAATAC CTAT TGTATAAGACC CT T T AC TGATTCCATTAATGT CGCATT T 
GTTT TAAGATAAAACT T C T TTCATAGGTAAAAAAAAAAA 
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FIGURE 124 



a 



MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAGCLG 
LS LRYNSLQKLKYNQFKGLNQLTWLYLDHNHI SNI DENAFNG IRRLKEL I LS SNRIS YFLNNTFR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
LARNVFAGMI RLKELHLEHNQFSKLNLAL FPRLVS LQNLYLQWNK I S VI GQTMS WT WSS LQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKS FKGLRENT 1 1 CAS PKELQGVNVIDAVKNYS I CGKSTTERFDLARALPKPT FKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide; 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499/ 504-507 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myr i s toyl a tion s i te s . 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 



App_ID=10063570 



Page 274 



FIGURE 12* 



CCGTTATCGTCTTGCGCTACTGCTG AATG TCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCC T GGATCTCACAAAAACTCGAC TCCAAATGCAAGGAGAAGCAGCT CT TGC TC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
ATT GAAGAGGAAGGCT T TCTAAAGCT TTGGCAAGGAGTGACACCCGCCAT TTACAGACACG TAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 

W* TTAGCCAATCCAACTGACCTAGTGAAGGTTCAGATGCAAATGGAAGGAAAAAGGAAACTGGAAGG 
AAAACCAT TGCGAT TTCGTGGT GTACAT CATGCAT T T GCAAAAATC T TAGCTGAAGGAGGAATAC 

™e GAGGGCT T TGGGCAGGCTGGG T AC CCAATATACAAAGAGCAGCACTGGTGAATATGGGAGAT TTA 

Fsl * 

i s j ACC ACT T ATGAT AC AG TGAAAC AC TAC T TGG T AT TGAAT AC AC CAC T TGAGGAC AATATC ATGAC 

btsg 

yl TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
% l TCAAAAGCAGAATAATGAAT CAACCACGAGATAAACAAGGAAGGGGACT T T TG TATAAAT CATCG 

ACT GAC TGCT TGAT TCAGGCTGTT CAAGG TGAAGGAT TC AT GAGTC TATATAAAGGCT TT T TACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
TGAGTGGAGTCAGTCCATTTTAA 



III 
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FIGURE 126 



MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHVVYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VIGGMMAGVI GQFLANPTDLVKVQMQMEGKRKLEGKPLRFRGVHHAFAK I LAEGG IRGLWAGWVP 
NIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 

CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCC ATGG CCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAATGAAGAGGACATCC TC TC CATGTTC TCAGCTAT CCGTT C TCAGCACAGCGG T G T AGACAT C 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
AGGAGCGGAATGTGGACGATGGGCACATCATTAACATCAATAGCATGTCTGGCCACCGAGTGTTA 
CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
AGACACAATT CGCC T TCAAAC T CCAC GACAAGGACCCTGAGAAGGCAGCTGCCACCTATGAGCAA 
ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CAT CCAGATTGGAGACATCCAGAT GAGGC CCAC GGAGGAGGTGACC TAGTGACT GT GGGAGCTCC 
TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
CACGGGATACCACT T CCTGTCCACACCCCGACCAGGGGC TAGAAAAT T TGT TTGAGAT TT T TATA 
TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCCTGGCCCAGTGGATT TCATGGTGAT CATTAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 



MARPGMERWRDRLALVTGASGGIGAAVARALVQQGLKVVGCARTVGNIEELAAECKSAGYPGTLI 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNA^ 

SAYQSMKEROTDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
I S PGWE TQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVI YVLS T PAHI QI GDI QMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 



N-rayristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 



Wf 199-205 

■r. 

Q 

Short-chain alcohol dehyrogenase. 

□ amino acids 30-42, 104-114 

rs 



; sjf. 



App_ID=10063570 



Page 278 of 3 18 



FIGURE 120 

AA.CTTC1AC ATG GGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCT GTAAAAGGCCATGGAACT T TGGGTGAATCAC CGAT GC CATT TAAGAGGGTT T TC T GCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAAATGAGGAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
TTAATT T TT AACTGATAGT T G TACATAT T TGGGGG TACATGTGATAT T TGGATACATGTATACAA 
Y*k TATAT AATGATCAAATCAGGGTAACT GGGATATCCAT CACATCAAACAT T TAT T T T T TAT T CT T T 

0 TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 
;gf CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
GCACC AC AATGCCCAACTAAT T TT TGTAT T T T TAG TAGAGACGGGG T TTT GCCATGT T GC CCAGG 
jjj CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
\\ GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
:Wl ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
* ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 

r \ 

X\\ AAC CTCT GATCACC TCAT TCTACT C T CTACC TCCATGAGATCCACT T T T T TAGCTCCCACAT GTG 

m AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 

ftj TTC CATCCATGT TGCTGCAAAT GACAGGAT T TCGT TC T TAAT T T CAAT TAAAATAACCACACATG 

SJ gcaaaaa 
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FIGURE lfto 

MGLLLLVLFLSLLPVAYTIMSLPPS FDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGT LGE S PMP FKRVFCQDGNVRS FCVCAVHFS SHQP PVAVE CLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

s"* N-myristoylation site. 

o 

Ull amino acids 86-92 

L-J 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 




"AH ■} 

y 
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FIGURE 131 



TT CT GAAGT AACGGAAGCTACCT T GT AT AAAGACCTC AAC AC TGCTGACCMGATC AGCGCAGCCTGG AGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAAT ACCAGAGGAT GCT ACAACT CTCTACCTT CAGAACAACC AAATAAAT AAT G CTGGGATT CCTTCAGAT 
TT GAAAAACT TGCT GAAAGT AGAAAG AATATACCTATACCAC AAC AGT TT AGAT GAATTT CCT ACCAACCT 
CCCAAAGT ATGT AAAAGAGT TACATT TGCAAGAAAATAAC AT AAGG ACT AT C ACTT ATGAT T CACTTTCAA 
AZiATTCCCTATCTGGAAGAATTACATTTAGATGACAACTCTGTCTCTGCAGTTAGCATAGAAGAGGGAGCA 
TT CC GAGACAGCAACT AT CT CCGACTGCTTTT CCT GTCCCGT AAT C ACCTTAGCAC AATTCCCT GGGGTTT 
2* GCCC AGGACT AT AGAAGAACTACGCTTGGATGAT AATCGCAT AT CC ACTATTTC ATC ACCAT CT CTT C AAG 

'Zl GT CT C ACT AGT C TAAAACGCCT GGTTCT AG AT GGAAACCTGT TGAAC AATCATGGTTT AGGT GAC AAAGT T 

m 

ffit TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 

\\\ TCCAGGC ACAAACC TG AGGAAGCT TT ATCT TCAAGATAACCACATCAATCGGGT GCCCCC AAAT GCT T TTT 

If] CT TATCTAAGGC AGCTCT AT CGAC TGGAT ATGTCC AAT AATAAC CT AAGT AATT T ACCTC AGGGT AT C TTT 

S\ GATGATTTGGAC AAT AT AAC AC AACTGATT CT TCGC AACAAT CC CTGGT ATTGC GGGTGC AAGAT GAAAT G 

■Q GGTACGTGACTGGTTACAATCACTACCTGTGAAGGTCAACGTGCGTGGGCTCATGTGCCAAGCCCCAGAAA 

■ H AGGTTCGTGGGATGGCTATTAAGGATCTCAATGCAGAACTGTTTGATTGTAAGGACAGTGGGATTGTAAGC 

WJ ACCATT C AGAT AACC ACTGCAAT ACCCAAC AC AGTGTATCCT GCCCAAGG ACAGTG GCC AGC TCC AGTGAC 

! H 

* r ; CAAAC AGCCAGAT AT T AAGAAC CCC AAGCTCACTAAGGATCAACAAACCACAGGGAGTCCCT CAAG AAAAA 

2* CAAT T AC AAT TACT GTG AAG TCTGTC ACCTCT GAT ACCATTCATATCTCT TGGAAACTTGCT CT ACCT ATG 

;L':5 ACTGCTTTGAGACTCAGCTGGCTTAAACTGGGCCATAGCCCGGCATTTGGATCTATAACAGAAACAATTGT 

*g s AACAGGGG AACGCAGTGAGT ACT TGGTC ACAGCC CTGGAGCCT GAT TCACCC TAT AAAGT AT GCATGGTTC 

; If 

CC ATGG AAAC CAGC AACCT C TACC T ATTTGAT GAAACTCCTGTT TGTATTGAGACTGAAACT GCACC CCTT 
CGAATGTACAAC CC T ACAAC CACCCTCAATCGAGAGCAAG AGAAAG AACCTT ACAAAAACCCCAATT TACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
T T CAT AGG AAT GGAT CGCTCT T CT C AAGGAACTGT GCAT ATAGCAAAGGGAGGAGAAGAAAGGATG AC TAT 
GC AGAAGCTGGC ACT AAG AAGGAC AACTCT AT CCT GGAAAT C AGGG AAACTT CT TTTC AGAT GT T AC CAAT 
AAGCAATGAACCCATCTCGAAGGAGGAGTTTGTAATACACACCATATTTCCTCCTAATGGAATGAATCTGT 
ACAAAAACAATCACAGTGAAAGCAGTAGTAACCGAAGCTACAGAGACAGTGGTATTCCAGACTCAGATCAC 
TC ACACTC ATGA TG CTGAAGGACT CACAGCAGACT T GTGTTT TGGGTT TTTT AAACCT AAGGGAGGTG ATG 
GT 
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FIGURE 132 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKTVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNSVSAVSIEEGAFRDSNYLRLLFLSRNHLSTIPWGLPRTIEELRLDDNRISTISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGI FDDLDNITQLI LRNNPWYCGCKMKWVRDWLQSLPV 
KVNVRGLMCQAPEKVRGMAIKDLNAELFDCKDSGIVSTIQITTAIPNTVYPAQGQWPAPV 
IKNPKLTKDQQTTGS PSRKT I T I TVKS VTSDT IHI SWKLALPMTALRLSWLKLGHSPAFGS ITET 
fab IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
O PYKNPNLPLAAI I GGAVALVT IALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 

ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 
amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites ♦ 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-57 0 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 



If! 
H 

t ss 

i S 5 
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FIGURE 133 



CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCAGCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
M TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 

o 

Zl GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 

y % 

I CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
jjj GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
GACCACTCCAACAGAGCCAGGACTATATCAACC TCTTCTGCGCCAACATGATGGACTT GAACCGC 
^ AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
[jj TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
Q AACAGCAGGAAGGATGCTTCGGGGAGCCTGATGCTGAAGATGAAGAATTATCTAAAGCTATTCAA 
fll TAT CAGCAGCATT T T TCGAGGAGAG T GAAGAGGCGAGAAAAACAAT T TCCAGAT TCTCGCT CTGT 

y TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGC TGGGAC TAGAGGAGCGTGCGACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCT GAAAAGCACT T T CAAAGAGACTGTGT TGAATAAAGGGCCAAGG T TC T TGCCACCCAGCAC T C 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
G TGGCT T CC TATACATCC TGGC AGAATACCCCCCAGCAAACAGAGAGCCACACCCAT CCACACCG 
CCACCACCAAGCAGCCGC TGAGAC GGACGGT TCCATGCCAGC T GCC TGGAGGAGGAACAGACGCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 
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FIGURE 134 

MSARGRWEGGGRRACRGSLGLARAQGAERVTS SEQRPAMASLGLLLLLLLTALPPLWSS SLPGLD 
TAESK^TIADLILSALERATVFLEQRLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDAS.LVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANMMDLNRRAEAIGYAYPTRDI FMENIMFCGMGGFSDFYKLRWLEAI LSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1. 

amino acids 230-237 
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FIGURE 135 



GG T C T GAG T G CAGAGC T G C T G T CATGGCGGCCGCT C T G T GGGGC T T C T T T C CC GT C C T GC T GC T G 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGC T T ACAGAT T TGATCCCGTT CGAGTGGATATCACTTC GAAAGGAAAAAT GAGAGCAAGATATG 
TGAATTACATCAAAACATCAGAGGTTGTCAGACTGCCCTATCCTCTCCAAATGAAATCTTCAGGT 
CCACC T T C TTAC TT TAT TAAAAGGGAATCGTGGGGC TGGAC AGACT T TCTAATGAACC CAATGGT 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
CTGACAT GAGACGGGAAATGGAGCAGTCAATGAATATGCTGAAT TCCAACCATGAGTT GCCTGAT 
GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 



Lf 1 ' TAAAACAGGCAAAAGTGGGGCT GGCAAAAGGAGGTAGTCAGGCCGTCCAGAGCT GGCATTTGCAC 

k 4 AAACACGGCAACACTGGGTGGCAT CCAAG TC T TGGAAAACC G T GTGAAGCAAC TACTATAAAC T T 

GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
GAGG TCAATAT TGATGT CACTGAAT T AAT TACAGT GT CCTATAGAAAATGCCAT TAAT AAAT TAT 



Q ATGAACTACTATACATTAT GTATAT TAAT TAAAACAT CT TAAT CCAGAAAT CAAAAAAAAAAAAA 



rSI AAAAAAAAAAAAAAA 
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FIGURE 1*6 



MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGS FWHDI PSGS YWEW 
VVRLPYPLQMKSSGPPSYFIKRESWGWTDFIJ^INPMVM^^ 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

N-glycosylation site. 

amino acids 184-187 

Glycosaminoglycan attachment sites. 

amino acids 37-40, 236-239 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

. N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 



GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CT T ACC T GCTGGGCAC TAAC GGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCAC GACC TGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
AAGCTGTGTGATCGCCACAAACCTTCAGGAAATACGAAATGGATTTTCTGAGATACGGGGCAGTG 
TGCAAGCCAAAGATGGAAACAT TGACATCAGAATC T T AAGGAGGAC TGAG TCT T T GCAAGACACA 
AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
CCATCAAGAAGGACC TCCGGC T CT CT CATGCCCACATGACATGCCAT TGTGGGGAGGAAGCAATG 
AAGAAATACAGCCAGAT TCTGAGT CACT TTGAAAAGCTGGAAC C TCAGGCAGCAG T TGTGAAGGC 
TTTGGGGGAACTAGACATTCT TCTGCAATGGATGGAGGAGACAGAATAGGAGGAAAGT GATGCTG 
C TGCTAAGAATAT T CGAGGTCAAGAGCT CCAGTC T TCAATACC TGCAGAGGAGGCATGAC CCCAA 
ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 
TGTAATATCTTTCTGCTAT TGGATATAT TTAT TAGTTAATATAT TTAT TTATTT T T TGCTAT TTA 
ATGTATTTATTTTTT TACT TGGACATGAAACTTTAAAAAAATTCACAGAT TATATT TATAACCTG 
ACTAGAGCAGGTGATGTAT TT TTATACAGTAAAAAAAAAAAACCT TGTAAATTC TAGAAGAGTGG 
C TAGGGGGGT TAT TCAT T TGTATT CAAC TAAGGACAT AT T T ACTCAT GCT GAT GCT CT GT GAGAT 
AT T TGAAATTGAACCAATGACTAC T TAGGAT GGGT TGTGGAAT AAG T T T TG2VT GTGGAAT TGCAC 
ATCTACCT TACAAT TACTGACCAT CCCCAGTAGACTCCC CAGT CCCATAAT TG TGTAT CT TCCAG 
CCAGGAATCCTACACGGCCAGCAT GTAT T TCTACAAATAAAGT TT T CTT TGCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 

MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 
SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 
FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 
SLAKSFLTIKKDLRLSHAHMTCHCGEEAMKKYSQILSHFEKLEPQAAWKALGELDILLQWMEET 
E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACG ATGC GGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GG T CAAAGGGTGTGAAAT T TAT GC CC TACACAACT TACC T T G T GGAAAAAGGAGCATCTCACAGT 
T T T G T GGC TGAAGCAAAGCCACCT ACAGT TACTATGCCT CGAAT CAAGGCAT T GAT GACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGCTGGAAAAAGAATAG TC T T T T ATGGAGATGAAACCTGGG T TAAATTA 
TTCCCAAAGCATTTTGTGGAATATGATGGAACAACCTCATTTTTCGTGTCAGATTACACAGAGGT 
GGATAAT AATGTCACGAGGCATT TGGATAAAGTATTAAAAAGAGGAGATT GGGACATAT TAATCC 
TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
AAGCTGAGCGAGATGGACAGCGTGCTGATGAAGATCCACACCTCACTGCAGTCGAAGGAGAGAGA 
GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
CCCGGTGATATCCGACATCCAAAGCACGTCCA ATAGA CGGATGTGGCTGCGACACTGGCGATAGC 
ACT TGGC T T ACCGATTCCAAAAGACAGTG TAGGGAGC CT CCTAT TCCC AGT TG T GGAAGGAAGAC 
CAATGAGAGAGCAG T TGAGATT TT TACAT T TGAATACAGTGCAGCT TAGTAAACTGT T GCAAGAG 
AATGTGCCGTCATATGAAAAAGATCCTGGGTTTGAGCAGTTTAAAATGTCAGAAAGATTGCATGG 
GAAC T GGAT CAGAC T G TAC T T GGAG GAAAAGCAT T CAGAAG T C C TAT T CAACC T G GG C TC CAAGG 
TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
* AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
j TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 
CACT CAGGAGGG TCAAAAGGAGACT T GGTCGCACCACTCAT CC TGCCACCCCCAGAAT GCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACTCAGATCCACAGAGCCCAGGATCAAGGGACCCACTGCAGTGGCAGCAGGACTGTTGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CTGACCGAGACACTCACAGCTTTGTCATCAGGGCACAGGCTTCCTCGGAGCCAGGATGATCTGTG 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
C TGCACACAGTATGTAGT TACCAAAAGAATAAACGGCAATAAT TGAGAAAAAAAA 
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FIGURE mo 



MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVL I DALRDDFVFGSKGVKFMP YT T YLVEKGASHS FVAE AKP PTVTMPRI KALMTGS LPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVL CGDHGMS E TGSHGAS S T EE VNT PL I L I S S AFERKPGD I RHPKHVQ 

Important features of the protein: 
Signal peptide: 



amino acids 1-34 



Transmembrane domain: 

amino acids 58-76 

N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site* 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 



AppJD=10063570 



Page 290 of 318 



FIGURE 141 

GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
TCCCTATAGAAAACAACTGCCAGCACCTTAAGACCACTCACACCTTCAGAGTGAAGAACTTAAAC 
CCGAAGAAATTCAGCATTCATGACCAGGATCACAAAGTACTGGTCCTGGACTCTGGGAATCTCAT 
AGCAGT T CCAGATAAAAACT ACAT ACGCCCAGAGATCT T CTT T GCAT TAGCCTCATCCT T GAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGTGACAAGGATAAAGGACAAAGTCATCCATCCCTTCAGCTGAAGAAGGAGAAACTGATGAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GTT GGGGTGACAGATAAATTTGAGAACAGGAAACACATT GAAT TTTCATT TCAACCAGTTTGCAA 
AGCTGAAAT GAGCCCCAGT GAGGT CAGCGAT TAGGAAACTGCCCCAT TGAACGC CTTC CT CGCTA 
ATTTGAACTAATTGTATAAAAACACCAAACCTGCTCACT 
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FIGURE 142 

MLLLLLEYNFPIENNCQHLKTTHTFRVKNLNPKKFSIHDQDHKVLVLDSGNLIAVPDKNYIRPEI 
FFALAS S LSSASAEKGS P I LLGVSKGE FCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESAE^RPFI 
FYRAQVGSWNMLESiy^PGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 



AppJD=10063570 



Page 292 



FIGURE 143 

CTAGAGAGTATAGGGCAGAAGGAT GGCAGATGAGTGAC T CCACATCCAGAGC TGCCTCCC T TTAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCGTGTTGCCAGTGTGGGGTGAGACT^AGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGTGGAGT T T GCT GTCCACACAT TCAACCAACAGAGCAAGGACTACTAT GCCTACAGAC TGGGG 
CACAT CT TGAATT CCTGGAAGGAGCAGGTGGAG TC CAAGACT GTAT TC TCAAT GGAGC TACTGCT 
GGGGAGAACTAGGTGTGGGA^TTTGAAGACGACATTGACAACTGCCATTTCCAAGAAAGCACAG 
AGCTGAACAATACT TTCACCTGCT TCTTCACCATCAGCACCAGGCCCTGGATGACT CAGT TCAGC 
C TC C T GAACAAGACCTGC TT GGAGGGAT TCCAC TGAGTGAAAC CCAC TCACAGGCT TGTCCATGT 
GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
ACACATCT TGAGCCTAATCATGTAGT GTAGATCATTAAACATCAGCATT TTAAGAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE ld4 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHNVMARYLPATVSFAVHTFNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites • 

amino acids 117-121, 139-143 

N-rnyristoylation site. 

amino acids 9-15 
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FIGURE 145 

CTGTGCAGCTCGAGGCTCCAGAGGCACAC'i'CCAGAGAGAGCCAAGGT'i'CTGACGCG ATGA GGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAAT CAAGT GGAACCGGAAGGCCCTGCCCAGCAC TGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACT TCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACT GGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGC GGCGAACCAGGGGGAGT TC CAGAAGCCAGACAACAAGCT CCAC CAGCAGG T GC TCTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
C TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
Q AAGC T TGCCAGGAGGC TGGCAGTACAGAGCGCAGCAGCGAGCAAAT CCTGGCAAGT GACCCAGCT 

0! CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
W TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTA7\ATGCTTGAT 

# H GAGAAGAACACATCAGGCACTGCGCCACC TGCT TCACAGTACTTCCCAACAACT CT TAGAGGTAG 

few* 

H GTGTAT T CCCGTT TTACAGATAAGGAAAC TGAGGCCCAGAGAGC TGAAGTAC TGCACCCAGCAT C 

Q ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGGTTGTCTAACCCCAGGTTTTCTGCTCT 
GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
C TC TAAT gaaat tg T GAAAGCT ccat GT t TAGAAATAAATGAAAACACCTGA 
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FIGURE 146 



MRKHLSWWWIATVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITEAQVAENRPGAFIKQGRK 
LD I D FGAEGNR YYEANYWQ FPDG I HYNGC S EANVTKEAFVT GC I NATQAANQGE FQKP DNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites. 

amino acids 98-102, ■ 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 76-83 

N-myristoylation sites . 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 

GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
C AAC T T T CAGAT TCAGGGGGTACATGTGAAGGT T TG T T T TATGAGTATATT GCATGAT GCT GAGG 
TTTGGGGT 



Q 



rij 
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FIGURE 148 

MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 



Important features of the protein: 
Signal peptide: 

amino acids 1-25 



N-myristoylation site. 

amino acids 62-68 

5 
m 

UJ. 
Ui 

Q 

a 
m 
a 

111 

a 
ru 
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FIGURE 14Q 

GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAAT CTCTACATAATAGAGAATTAAGTGCAGAAAGACCTTT GAAT GAACAGATTGCTGAA 
GCAGAAGAAGACAAGATTAAAAAAACATATCCTCCAGAAAACAAGCCAGGTCAGAGCAACTATTCTTT 
TGTTGATAACTTGAACCTGCTAAAGGCAATAACAGAAAAGGAAAAAATTGAGAAAGAAAGACAATCTA 
TAAGAAGCT CCCCACTT GATAATAAGT TGAATGTGGAAGATGTTGATT CAACCAAGAAT CGAAAACTG 
ATCGATGATTAT GACTCTACTAAGAGTGGATTGGAT CATAAATTTCAAGATGATCCAGATGGTCTTCA 
T CAACTAGACG G GACT C CT T TAAC CGC T GAAGACAT TGT C CATAAAAT C GCT G CC AG GAT T TAT GAAG 
AAAATGACAGAGCCGTGTTTGACAAGATTGTTTCTAAACTACTTAATCTCGGCCTTATCACAGAAAGC 
CAAGCACATACACTGGAAGAT GAAGTAGCAGAGGT T TTACAAAAATTAATCTCAAAGGAAGCCAACAA 
TTATGAGGAGGATCCCAATAAGCCCACAAGCTGGACTGAGAATCAGGCTGGAAAAATACCAGAGAAAG 
TGACTCCAAT GG CAG CAAT T C AAGAT GGT CT T GCTAAGG GAGAAAACGAT GAAAC AGTAT C TAACAC A 
TTAACCTT GACAAATGGCTT GGAAAGGAGAACTAAAACCTAGAGT GAAGACAACTTTGAGGAACTCCA 
ATAT TTC C CAAATTTCTATGC GCTACT GAAAAGTAT T GAT TCAGAAAAAGAAGCAAAAGAGAAAGAAA 
CACTGATTACTATCATGAAAACACTGATTGACTTTGTGAAGATGATGGTGAAATATGGAACAATATCT 
CCAGAAGAAGGTGTTTCCTACCTTGAAAACTTGGATGAAATGATTGCTCTTCAGACCAAAAACAAGCT 
AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 
CAG AC AG T ACC AAGG AAGAAG CAG C T AAGAT GGAAAAGGAATATG GAAG CT T GAAG GAT T C CACAAAA 
GATGATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGC 
CATCAGAAAAAATATTGAATGGTTGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTATGACCTTT 
CAAAGATGAGAGACTTCATCAATAAACAAGCTGATGCTTATGTGGAGAAAGGCATCCTTGACAAGGAA 
GAAGCCGAGGC C ATC AAGCGCAT T T AT AGCAGC C T GTAAA AAT GGCAAAAGAT CCAG GAGT CT T T CAA 
CTGTTTCAGAAAACATAATATAGCTTAAAACACTTCTAATTCTGTGATTAAAATTTTTTGACCCAAGG 
GTTATTAGAAAGTGCTGAAT T TACAGTAGTTAACCT T TTACAAGT GGT TAAAACATAGCTTTCTTCCC 
GTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAAAAAAAAAAAAAAAAA 
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FIGURE l^o 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYSFVDNLNLLKAITEKEKIEKERQSIRSSPLDNKLNVEDVDSTKNRKLIDDYDSTKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKTYSEDNFEELQYFPNFYALLKS IDSEKEAKEKETL I T IMKTL I DFVKMMVKYGT I SPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 
220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-rayristoylation sites : 

amino acids 143-148, 239-244 
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FIGURE isi 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
G ATGG TCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAACCAGCT TCTAGCTGGAGGGCTGCATGCAGGGAAGGTCAT TAAAGGTGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 



TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
m CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
%\ GGGTCAGTAGCTC TCCACAT GAAGTCC T G TCAC TCACCAC TG T GCAGGAGAGGGAGGT GGTCATA 

21 GAGTCAGGGATCTATGGCCCTTGGCCCAGCC.CCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
WJ TGCTAGCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
LlJ GAAATGGCT CGAGC TCAGAAGATAAAAGATAAG TAGGGTATGC TGATCC TC T T T TAAAAACCCAA 

ifl GAT ACAATCAAAAT CCGAGATGCT GGTCT CTAT TCCCATGAAAAAGT GCTCATGACATAT T GAGA 



C | AGACCTAC T TACAAAG TGGCAT ATAT TGCAAT T TAT T T TAAT TAAAAGATACCTAT T TATATAT T 

^ TCT T T AT AGAAAAAAGTC TGGAAGAGT TTAC T TCAATT GTAGCAAT GTCAGGGT GGTGGCAGTAT 

kdj AGGTGAT TTTTCTTT TAAT TCT GT TAAT T TAT C TGTAT TT CC TAAT T TT T CTAC AATGAAGAT GA 

P AT T CCT T GT ATAAAAATAAGAAAAGAAAT TAATCT TGAGGT AAGCAGAGCAGACAT CATCTCT GA 

p TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
f« TTGTAG T AGT GAT CAGGAAACAGAT CTCAGCAAAGCCACT GAGGAGGAGGCTGT GG TGAGTT TGT 

%l GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAAGAAAAATCATCTGGTAATTCTTTCCTAGAAG 
GAT CACAGC C CC T GGGAT T C CAAGG CAT TGGAT CC AGT C T C T AAGAAGGC T GC T G T AC T GG T TGA 
flj AT T GTGT CCCCCTCAAAT TCACAT CCT TC T T GGAATCT CAGT C TGT GAGT T TAT T T GGAGATAAG 

Q GTCTCTGCAGATGTAGTTAGT TAAGACAAGG TCATGCTGGATGAAGGTAGACCTAAATTCAATAT 

ffj GACTGG T T TCCT T G TATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGACTATGTA 

■ ** AAGATGAAGGCAGAGATCGGAGT T T T GCAGCCACAAGC TAAGAAACACCAAGGATT GT GGCAACC 

ATCAGAAGCT TGGAAGAGGCAAAGAAGAATTCT TCCCTAGAGGCTTTAGAGGGATAACGGCTCTG 
CTGAAAC C T TAAT C TCAGACT T CCAGCCTCC T GAACGAAGAAAGAATAAAT T T CGGCT GT T T TAA 
GCCACCAAGGATAAT TGG TTACAGCAGCTCTAGGAAACTAATACAGC TGC TAAAAT GATC CCTGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATATGGCAGAAGTGAT GGCAT GCCACT T CCAAGAT TAGGT TATAAAAGACAC TGCAGC T TC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTT-GGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GCCCACAGCCACAT TAGT GAAC CTAGAAGCAGAGACTC T G T GAGATAATCGATGT TTGTT GT TTT 
AAGT T GCTCAGT T T TGGT CT AACT T GTTATGCAGCAATAGATAAATAATATGCAGAGAAAGAG 
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FIGURE 152 

MVLSGALC FRMKDS ALKVL YLHNNQL IjAGGLHAGKVI KGEE I SWPNRWLDAS LS PVI LGVQGGS 
QCLSCGVGQEPTLTLEPWIMELYLGAKESKS FTFYRRDMGLTSS FESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-rrryristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-63, 73-78, 91-96, 106-111 

Interleukin-1 signature . 

amino acids 111-131 

Interleukin-1 proteins . 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE isa 



CT TCAGAACAGGT TCTCCTT CCCCAGTCACCAGT TGCT CGAG T TAGAAT 1'GTC T GCA ATG GCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TAT TGAAGGTGATGACC TGCAT AT CC AGAGGAAT G TGCAAAAGCTGAAGGACACAGTGAAAAAGC 
TTGGAGAGAGTGGAGAGATCAAAGCAAT TGGAGAAC T GGAT T T GCTGTT TATG T CT CTGAGAAAT 
GCCT GCAT T TGACCAGAGCAAAGCTGAAAAAT GAATAAC TAACCCCC TT TCCC T GCTAGAAATAA 
CAAT TAGATGCCCCAAAGCGAT TT TTTT TAACCAAAAGGAAGATGGGAAGCCAAACTCCATCATG 
ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
AGACCAGAAGGTAGAC TT TCTAAGCATAGATAT T TAT TGAT AACAT T TCATT GT AACTGGT GT TC 
TATACACAGAAAACAATT TATTTT TTAAATAAT TGTCTT TTTCCATAAAAAAGATT ACTT TCCAT 
TCCT TTAGGGGAAAAAAC CCCTAAAT AGC T TCATGT TT C CATAATCAGTACT TTAT AT T T ATAAA 
TGTATT TATTATTATTATAAGACTGCAT TTTATT TATATCATT TTATTAATATGGAT TTATTTAT 
AGAAACATCATTCGATAT TGCT ACT TGAGTG TAAGGCTAAT AT TGATAT T TATGACAATAATTAT 
AGAGCTAT AACAT GTT TAT T TGAC CTCAATAAACACTTGGATATCCC 
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FIGURE 1ZA 

MAALQKSVS S FLMGTLAT S CLLLLALLVQGGAAAP I S SHCRLDKSNFQQP Y I TNRT FMLAKEAS L 
ADNNTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites . 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE 155 

GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGC TGT GCTGAGAGGGAGAGGAGCAGAGAT GCT GC TGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTGATTAGCCTTTTCCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCT GCTC TACCACAACCAGAC TGTC T TC TACAGGC GGCCAT GC CATGGCGAGAAGGGC AC CCACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 
CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCCCAGCACAGGCACTTTCTAGATATTTCCCCCTTGCTGGAGAAGAAAGAGCCCCTGGTTTTATT 
TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
TTGACAT GGAT GAT TC TGAGGAGGAAGCTGTTAT T GAAT GTATAGAGAT T TATC CAAATAAATAT 
CT T TAT T TAAAAATGAAAAA 
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FIGURE is6 

MRERPRLGEDSSLISLFLQWAFIAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites . 

amino acids 44-50, 150-156 
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FIGURE 157 

CCGGCG ATGT CGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACT TGAGGGACC TCCGAGTAGAACC TG T TACAACTAGTGT TGCAACAGGGGACTAT TCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GAT T TGT GTGACGGGCAAAAGCAACT TCCAGT CCTACAGCT G T G TGAGGTGCAAT T ACACAGAGG 
CCTTCCAGAGTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCTGAACACAGTCTATTTCATTGGGGCCCATAATATTCCTAATGCAAATATGAATGAAGATGG 
CCC TTCCATGTCTGT GAAT T T CAC C T CACCAGGCTGC C TAGAC CACATAATGAAAT AT AAAAAAA 
M AGT GTGT CAAGGCCGGAAGCCT GT GGGATCCGAACAT CACT GC T TG TAAGAAGAAT GAGGAGACA 

W GTAGAAGTGAACT TCACAACCACT CCCCTGGGAAACAGATACATGGCTCT TATCCAACACAGCAC 

TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 
fH TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
jfl AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
%i GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
y CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 
* v . TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
TCACACAAT T TGT TACT TCACT GAAT TTCT T CAAAACCAT TGC AGAAG T GAGGT CATCCT TGAAA 

m 

£J AGT GGCAGAAAAAGAAAATAGCAGAGATGGGTCCAGTGCAGTGGC T T GCCAC TCAAAAGAAGGCA 

f|| GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
ft! GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
Hl GTGATCTAAGAAGCCAGAT TCATCTGCACAAATACGTGGTGGT CTACTT TAGAGAGATTGATACA 

AAAGACGATTACAATGCTCTCAGTGTCTGCCCCASiGTACCACCTCATGAAGGATGCCACTGCTTT 
C T GT GCAGAACT TC TCCATGTCAAGCAGCAGGT GTCAGCAGGAAA&AGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE 1*8 



MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSWV 

LRADASIRIJiKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTrSYIGFPVELlSITVYFIGAHNIP 

NA3SIMNEDGPSMSVNFTSPGCLDHIMKYKKKCVKAGSLWDPNITACKKNEETVEVNFTTTPLGNR 

STIIGFSQVFEPHQKKQTRASWIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTVVLCPQTGVPFPLDNNK 

SKPGGWLPLLLLSLLVATWLVAGIYLMWRHERIKKTSFSTTTLLPPIK7LVVYPSEICFHHTICYFTEFL 

QNHCRSEVILEKWQKKKIAEMGPVQWLATQKKAADKWFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 

FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 

GCCSL 



m 

Ins :■ 



rii 
Ci 



Important features of the protein: 
Signal peptide: 

amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160 r 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and cGMP- dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 



Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 



N-myristoylation site. 

amino acids 116-122 



Amidation site. 

amino acids 488-452 
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FIGURE 15Q 



AGCCACCAGCGCAAC ATGA CAGTGAAGACCCTGCATGGCCCAGCCATGGTCAAGTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
C TT T T T T CCAAAAGCC TGAGAGTT GC CCGCC T G TGCCAGGAGGTAG TAT GAAGC T TGACAT TGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACT TGGGCT GCATCAATGCT CAAGGAAAGGAAGACATCT CCAT GAAT TCCGT T C CCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 
, ( ACT CAGC TGAAGAAG 

rl 

n 



''•zss 

m 
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FIGURE 160 

MTVKTLHGPJMVKYLLLSIL^ 

QRVSMSRNIESRSTS PWNYTVT WDPNRYPS EWQAQCRNLGCINAQGKED I SMNS VP I QQE TL W 
RRKHQGCSVS FQLEKVLVTVGCTCVTPVIHHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



ACACTGGCCAAACAAAAACGAAAGCACTCCGTGCTGGAAGTAGGAGGAGAGTCAGGACTCCCAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCT GAAGATGAGGAAAAGT T TGGAGGAGCAGCT GAC T CAGGGGTGGAGGAGCCTAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
u GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
%l ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
Q CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
m CGAACATCTGCGCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
fjj CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
% GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
^\ TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
%l AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
Q CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
Z" G TGGCTGTACT TCAC TACCCAGCAAAGCC TCCACGAGGGCAGCTCGCC T T GGAGAGTACT TAC TA 

(L CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
y CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
if! CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
Q CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
fif CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
^ CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
W CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
ffj GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACT TAAATAAAGGCAGACGCTGT T T T TC TAAAAAAA 
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FIGURE 162 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQE^TDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQVVL S FQAYPTARCVLLEV 
QVPAALVQFGQSVGSWYDC FEAALGSE VR IWS YTQ PRYEECELNHTQQLPALPWLNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRIiTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VNS S EKLQLQECLWADSLGPLKDDVLLLETRGPQ DNRSLCALE P SGCT SL PSKASTRAARLGE YLLQDLQS 
GQCLQLW DDDLGAL WACPMDKY I H KRWAL VWL AC LL FAAAL S LI LLL KKD HAKG WL RLLKQD VRSG AAARG 
RAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQGPVAWFHAQRRQTLQEGGVWLLFSP 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSY FHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence: 

amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-586 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 165* 

GGGAGGGCTCTGTGCCAGCCCCGA^GGACGCTGCTGACCATCITGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACATC CT GACGTGGGACAGCGGGCCAGAGGGCACCCCAGACACGGT CTACAGCATC GAG TATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GTCAGTGCGGGAGGCCGG TCAGCCACCAAGATGACTGACAGGT TCAGC TC TC TGCAGCACAC TAC 
CC T CAAGCCACC TGATGTGACCTGTATCTCCAAAGTGAGAT CGAT TCAGATGAT T GT T CAT CC TA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
T TC TACCACT TAGAGC T CCAGGTCAACCGCACC TACCAAAT GCACCT T GGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 
TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
H CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
O TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
f 5 l AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
% TCCACAGCGGCATAGCCTGTCCGAGATCACCTACTTAGGGCAGCCAGACATCTCCATCCTCCAGC 
°f\ CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
Ui GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
y! GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
hj CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
£\ CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 
hH TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 
s CCCT GGGGAT T TGCACAGACAGAACATCTGACCCAAATGTGCTACACAGTGGGGAGGAAGGGACA 

Q CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
ill CCT.CCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
%i TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
J:!! GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
ill AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
CJ ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 
ffj AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 
AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAATGACAGTGCAAGGAGGAAATGCAGGGAAACTCCCGAGGTCCAGAGCCCCACCTC 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 
GCCTGGAAA^GAACCAGGCCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 
GAAATGCAGGTCCACCAGGGAGGGAGACACACAAGCCT T T TC T GCAGGCAGGAGT TTCAGACCC T 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 

TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 
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FIGURE i6a 



in 



m 



MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQWRTYQMHLGGKQREYEFFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNS LNVQRVLT FQPLRF I QEHVLI PVFDL S GPS S LAQPVQ YSQ I RVS G PRE PAGAPQRH S L S E I T 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGESGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
S PS DQG PS PWGL LE S L VC PKDEAKS P APE T S DLE QPT ELD S L FRGL AL T VQWE S 



rp Signal sequence. 

l|| amino acids 1-17 



Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites ♦ 

amino acids 80-83, 87-90, 172-175 

N~myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 165 

TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAACATGG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATAT GT C T AC TACAAAC TGGCAGGGGGATCCC TT GAAC TT TGGGCTGGA 
AGT G T TGAACACAGT TT TGGAT AT T T TCCAAAAGAT T TGAT CAAGGTACT TCATAAATACACGGA 
AGAAGAGC TACATAT T CCAGCAGATGAGACAGAC T TT G T CT GCT T TGAAGGAGGAAGAGATGAT T 
TTAATAGT TATAATGTAGAAGAGCTT TTAGGATCTTT GGAACTGGAGGACTCTGTACCTGAAGAG 
T CGAAGAAAGC T GAAGAAG T T T C T CAGCACAGAGAGAAAT C TC C T GAG GAG T C T CGGGG GCGT GA 
ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
CAGAGAGCACCGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCACCCTCACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGATGCT TACAAAGTCCTGAAAACAGAAATGAGTCAGAGAGGAAGTGGACAGTGCGTTAT T 
CAT T ACAG C AAAGGAT TTCGTTGG CAT C AAAAT C TAAG T T T G T T T T AC AAAGAT T GT T T T TAG T A 
C TAAGCTGCCT TGGCAGT T T GCAT T T T TGAGCCAAACAAAA&TATAT TAT T T TCCCT T C T AAGTA 
AAAAAAAAAAAAAAAAAAAA 
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FIGURE 166 



lyTAAAPGLLFWLFVLGALWWPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYWYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 

Important features of" the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 294-298 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-76 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 

Amidation site. 

amino acids 28-32 



AppJD=10063570 



Page 316 of 318 



FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
7VACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCT.GCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAG'CCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTGAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCTTGAGAATGAAGCAGAACAGAAGTTAATAGAGAGCATGTTGCAAAACCTGACAAAACCCGGG 
j«& ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
Q TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
Q ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
yl CTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAAGCACAATTATATTTG 
III C AAG TAT G AAC C AGAGAT T AAT C GAAC AGC C C C T G T AGAAAAG C C T TAT C T T AC AAAT C AAC C AG 
Lfl GAGACACCCATCAGAATGTGGTTGTTACTGAAGCAGGTATAATTCCCAATCTAATTTATGTTGTT 
'%l ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 

□ GCATAAAAGTAAAGGAAGAACAAAAACTAGTCCAAACCAGTCTACACTGTGGATTTCAAAGAGTA 
% CCAGAAAAGAAAGTQGCATGGAAGTATAATAACTCATTGACTTGGTTCCAGAATTTTGTAATTCT 

□ GGATCTGTATAAGGAATGGCATCAGAACAATAGCTTGGAATGGCTTGAAA 

Iff AAGATGAACTGTAAGCTCCCCCTTGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCA 
Q TTTAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAA 
rll ACTTCAAACTTCAAGCAAATGAAATGGACAATGCAGATAAAGTTGTTATCAACACGTCGGGAGTA 
Cf TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
ffj TGTATATTGTATTGAAATTTACAGTGTGCAAAAGTATTTTACCTTTGCATAAGTGTTTGATAAAA 
ATGAACTGTTCTAATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAG 
AAACTTATTACTGTTGTGAACTGAATTCACACACACACAAATATAGTACCATAGAAAAAGTTTGT 
TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
AATAAGAAGCTATTTCATTAAGTGTGATATAAACCTCCTCAAACATTTTACTTAGAG$C^GGAT 
TGTCTAATTTCAATTGTGGAAGACATGTGCCTTATAATTATTTTTAGCTTAAAATTAAACAGATT 
T T G T AAT AAT G T AAC T T T G T T AAT AGG T GC AT AAAC AC T AAT G C AG T C AAT T T GAAC AAAAGAAG 
TGACATACACAATATAAATCATATGTCTTCACACGTTGCCTATATAATGAGAAGCAGCICTCTGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
ACTATATTAGTATACAAAGAGGTCATGTGGTTGAGACCAGGTGAATAGTCACTATCAGTGTGGAG 
ACAAGCACAGCACACAGACAT T T TAGGAAGGAAAGGAAC TACGAAAT CG TGTGAAAATGGGT T GG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKH^ 

GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCVVMYHQPTANPGL^ 

PYLTNQPGDTHQNVWTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFQMLHKSKGRTKTSPNQ 
S TLW I SKS TRKE S GME V 



Important features of the protein: 
Signal peptide: 

: amino acids 1-21 

Transmembrane domain : 

a? 

LiJ amino acids 214-235 

If! ... 

r\ N-glycosylation sites. 

%■ amino acids 86-8-9-, 255-258 

■JJJ cAMP- and cGMP-dependent protein kinase phosphorylation site. 

Q amino acids 266-269 

m 

N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-11.4: 140- 
145, 212-217 
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